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HUBR-1205 (10201511) 
IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 
Applicant(s) : Tischer, et al. 

Serial No. : 10/049,750 

Filed : February 15, 2002 



For 



ENZYMATIC SYNTHESIS OF 
DEOXYRIBONUCLEOSIDES 



Commissioner of Patents 
and Tradernarks 
Washington, D.C. 20231 



Sir: 



December 9, 2002 



PRELIMINARY AMENDMENT 



In advance of prosecution, please amend the above-identified patent application as 



follows: 



IN THE CLAIMS 



Cancel claims 3-25, 28-31, 34-36, and 39-41 without prejudice. 
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REMARKS 



This preliminary amendment is the same amendment as that submitted on February 15, 
2002, and that amendment also included the statement regarding verification of the computer 
diskette/sequence listing. Apparently the sequence listing and diskette were indicated as being 
received, but the amendments to the claims were not entered. An IDS was also submitted on that 
date, and it is not certain if these were entered. Acknowledgement of receipt of these documents 
is respectfully requested. 



It is not believed that any additional claim fees are due in view of this amendment, but 
any fees that may be due may be charged to deposit account no. 50-0624. 



A supplemental preliminary amendment will follow. 



Respectfully submitted. 



FULBRIGHT & JAWORSKI, L.L.P. 



666 Fifth Avenue 

New York, New York 10103 

(212)318-3148 



By V ^tA^-^ ' ^'^■'^^ 

jf<> James R. Crawford 
// Reg. No, 39,155 
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Enzymatic synthesis of deoxyrtbonucleosides 

Description 

The present invention relates to a method for the in vitro enzymatic 
synthesis of deoxyribonucleosides and enzymes suitable for this method- 
Natural deoxyribonucleosides (deoxyadenosine, dA; deoxyguanosine, dG; 
deoxycytidine, dC and thymidine, dT) are building blocks of DNA. The N- 
giycosidic bond between nucleobase and sugar involves the of a 
pyrimidine or the Ng of a purine ring and the of deoxyribose. 
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In the living cells the four deoxyribonucleosides (dN) result from the 
"salvage pathway" of riucleotide metabolism. A group of enzymes is 
involved in cellular catabolism of deoxyribonucleosides. Besides 
deoxyriboaldolase (EC 4.1.2.4) and deoxyribomutase (EC 2.7.5.1), this 
group also includes thymidine phosphorylase (EC 2.4.2.4) and purine 
nucleoside phosphorylase (EC 2.4. 2. 1 ) . These four enzymes are induced by 
the addition of deoxyribonucleosides to the growth medium. The genes 
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coding for these enzymes have been shown to map closely together on the 
bacterial chromosome (Hammer-Jesperson and Munch-Peterson, 
Eur.j;Biochem.l7 (1970), 397 and literature cited therein). In E.coli the 
genes as described above are located on the deo operon which exhibits an 
unusual and complicated pattern of regulation (Valentin-Hansen et al., 
EMBO J.I (1982), 317). 

Using the enzymes of the deo operon for synthesis of deoxynucleosides 
was described by C.F.Barbas III (Overproduction and Utilization of Enzymes 
in Synthetic Organic Chemistry, Ph.D. Thesis (1989), Texas A&M 
University). He applied phosphopentomutase and thymidine phosphorylase 
for the synthesis of deoxynucleosides. Deoxyribose 5-phosphate was 
prepared by chemical synthesis (Barbas III et al., J.Am.Chem.Soc. 112 
(1990), 2013-2014), which makes this compound expensive as starting 
material and not suitable for large scale synthesis. He also made 
deoxyriboaldolase available as a recombinant enzyme and investigated its 
synthetic applicability but neither he nor C.-H.Wong (Microbial Aldolases in 
Carbohydrate Synthesis: ACSSymp.Ser.No.466:.Enzymesin Carbohydrate 
Synthesis, Eds. M.D.Bednarski, E, S.Simon (1991), 23-27) were able to 
carry out a coupled one-pot synthesis employing all three enzymes. It 
appears likely that some drawbacks exist which could not be circumvented. 
Among these drawbacks are insufficient chemical equilibrium, instability of 
intermediates, such as deoxyribose 1 -phosphate and inactivation and 
inhibition effects of involved compounds on the enzymes. 

Evidence of an advantageous equilibrium is given by S.Roy et al. (JACS 1 08 
(1 986), 1 675-78). For the aldolase reaction the equilibrium is on the desired 
product side (deoxyribose 5-phosphate), for the phosphopentomutase it is 
on the wrong side (also deoxyribose 5-phosphate) and for the purine 
nucleoside phosphorylase it is on the desired synthesis product side. The 
authors suggest coupling of the three enzyme reactions to obtain reasonable 
yields. Contrary to these suggestions they prepared deuterated 
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deoxyguanosine and thymidine in a two step procedure, that is deoxyribose 
5-phosphate in a first step and deoxynucleoside in a second step. Isolated 
yields of the second step were 11% and 5% for deoxyguanosine and 
thymidine, respectively. These low yields are also obtained in the 
preparation of arabinose-based nucleosides (Barbas III (1990), supra). 
These low yields indicate serious drawbacks for the use of the enzymes of 
the deo operon in a synthetic route which have to work in the reverse 
direction of their biological function, which is degradation of 
deoxynucleosides. 

Thus, there does not exist any economical commercial method at present 
for the enzymatic in vitro synthesis of deoxyribonucleosides. Hitherto, for 
commercial purposes, deoxynucleosides are generated from fish sperm by 
enzymatic cleavage of DNA. This method, however, involves several 
disadvantages, particularly regarding difficulties of obtaining the starting 
material in sufficient quantity and quality. 

Therefore, it was an object of the invention to provide a method, by means 
of which the drawbacks of the prior are eliminated at least partially and 
which allows efficient and economical synthesis of deoxyribonucleosides 
without any dependence on unreliable natural sources. 

Surprisingly, it was found that the drawbacks of previous enzymatic 
synthesis routes can be avoided and deoxyribonucleosides can be obtained 
in high yields of e.g. at least 80% based on the amount of starting material. 

In a first aspect, the present invention relates to a method for the in vitro 
enzymatic synthesis of deoxyribonucleosides comprising reacting 
deoxyribose 1 -phosphate (dRIP) and a nucleobase, wherein a 
deoxyribonucleoside and inorganic phosphate are formed. 
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The reaction is catalyzed by an enzyme which is capable of transferring a 
deoxyribose moiety to a nucleobase, with a deoxyribonucleoside being 
formed. Preferably, the reaction is catalyzed by a thymidine phosphorylase 
(TP, EC 2.4.2.4) or a purine nucleoside phosphorylase (PNP, EC 2.4.2.1). 
For the EC designation of these enzymes and other enzymes mentioned 
below reference is made to the standard volume Enzyme Nomenclature 
1992, Ed. E.C.Webb, Academic Press, Inc. 

These enzymes and other enzymes mentioned below are obtainable as 
native proteins from natural sources, i.e. any suitable organisms selected 
from eukaryotes, prokaryotes and archaea including thermophilic organisms. 
Further, these enzymes are obtainable as recombinant proteins from any 
suitable host cell which is transformed or transfected with a DNA encoding 
said enzyme. The host cell may be a eukaryotic cell, a prokaryotic cell or an 
archaea cell. Particular preferred sources of native or recombinant TP or 
PNP are prokaryotic organisms such as E.coli. Recombinant TP may be 
isolated from E.coli strain pHSP 282 {CNCM 1-21 86) deposited on April 23, 
1999, which is a recombinant E.coli strain transformed with a plasmid 
containing the E.coli deoA (thymidine phosphorylase) insert. Recombinant 
PNP may be isolated from E.coli strain pHSP 283 (CNCM 1-2187) deposited 
on April 23, 1 999, which is a recombinant E.coli strain transformed with a 
plasmid containing the E.coli deoD (purine nucleoside phosphorylase) insert. 
The nucleotide sequence of the TP gene and the corresponding amino acid 
sequence are shown in SEQ ID N0.1 and 2. The nucleotide sequence of the 
PNP gene and the corresponding amino acid sequence are shown in SEQ ID 
NO. 15 and 16 and 3 and 4. 

The nucleobase, to which the deoxyribose unit is transferred, will be 
selected from any suitable nucleobase. For example, the nucleobase may 
be a naturally occurring nucleobase such as thymine, uracil, adenine, 
guanine or hypoxanthine. It should be noted, however, that also non- 
naturally occurring analogs thereof are suitable as enzyme substrates such 
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as 2-thio-uracil, G-aza-uracil, 5-carboxy-2-thiouracil, S-aza-thymine, 6-a2a-2- 
thio-thymine and 2,6-diamino-purine. 

Preferably the inorganic phosphate is removed from the reaction. This 
removal is preferably effected by (i) conversion to inorganic pyrophosphate, 
(ii) precipitation/complexation and/or (iii) substrate phosphorylation. 

Conversion to inorganic pyrophosphate may be effected by a phosphate 
transfer from a phosphorylated, preferably polyphosphorylated substrate 
such as fructose diphosphate (FDP), wherein a phosphate group is cleaved 
from the phosphorylated substrate and reacts with the inorganic phosphate, 
with inorganic pyrophosphate (PPi) being formed. This phosphate transfer 
is preferably catalyzed by a PPi-dependent phosphorylase/kinase, e.g. by a 
PPi-dependent phosphofructokinase {PFK-PPi, EC 2.7.1.90), which catalyzes 
the reaction of fructose diphosphate (FDP) and inorganic phosphate to 
fructose 6-phosphate {F6P) and inorganic pyrophosphate. Preferred sources 
of PPi-dependent kinases/phosphorylases and genes coding therefor are 
from Propionibacterium freudenreichii (shermanii) or from potato tubers. 

Further, the inorganic phosphate may be removed from the reaction by 
precipitation and/or compiexation which may be effected by adding 
polyvalent metal ions, such as calcium or ferric ions capable of precipitating 
phosphate or by adding a complex-forming compound capable of 
complexing phosphate. It should be noted that also a combination of 
pyrophosphate formation and compiexation/ precipitation may be carried 
out. 

Furthermore, the removal of inorganic phosphate may be effected by 
substrate phosphorylation. Thereby the inorganic phosphate is transferred 
to a suitable substrate, with a phosphorylated substrate being formed. The 
substrate is preferably selected from saccharides, e.g. disaccharides such 
as sucrose or maltose. When using disaccharides as substrate, a 
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monosaccharide and a phosphorylated monosaccharide are obtained. The 
phosphate transfer is catalyzed by a suitable phosphorylase/kinase such as 
sucrose phosphorylase (EC 2.4. 1 .7) or maltose phosphorylase (EC 2.4, 1 .8), 
Preferred sources of these enzymes are Leuconostoc mesenteroides, 
Pseudomonas saccherophila {sucrose phosphorylase) and Lactobacillus 
brevis (maltose phosphorylase). 

The phosphorylated substrate may be further reacted by additional coupled 
enzymatic reactions, e.g. into a galactoside (Ichikawa et al., Tetrahedron 
Lett.36 (1995), 8731-8732). Further, it should be noted that phosphate 
removal by substrate phosphorylation may also be coupled with other 
phosphate removal methods as described above. 

Deoxyribose 1 -phosphate (dRIP), the starting compound of the method of 
the invention, is a rather unstable compound, the isolation of which is 
difficult. In a preferred embodiment of the present invention, dIRP is 
generated in situ from deoxyribose 5-phosphate (dR5P) which is relatively 
stable at room temperature and neutral pH. This reaction is catalyzed by a 
suitable enzyme, e.g. a deoxyribomutase (EC 2.7.5.1) or a phosphopentose 
mutase (PPM, EC 5.4.2.7) which may be obtained from any suitable source 
as outlined above. The reaction is preferably carried out in the presence of 
divalent metal cations, e.g. Mn^* or Co^* as activators. Preferred sources 
of deoxyribomutase are enterobacteria. Particular preferred sources of 
native or recombinant PPM are prokaryotic organisms such as E.coli. 
Recombinant PPM may be isolated from E.coli strain pHSP 275 (CNCM I- 
2188) deposited on April 23, 1999, which is a recombinant E.coli strain 
transformed with a plasmid containing the E.coli deo B (phosphopentose 
mutase) insert. The nucleotide sequence of the PPM gene and the 
corresponding amino acid sequence are shown in SEQ ID NO. 1 7 and 1 8 and 
5 and 6. 
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dR5P may be generated by a condensation of glyceraldehyde 3-phosphate 
(GAP) with acetaldehyde. This reaction is catalyzed by a suitable enzyme, 
preferably by a phosphopentose aldolase (PPA, EC 4.1.2.4). The reaction 
exhibits an equilibrium constant favorable to the formation of the 
phosphorylated sugar {K^^ = [dR5P]/[acetaldehyde] x [GAP] = 4.2 x 10^ x 
M'^). PPA forms an unstable Schiff base intermediate by interacting with the 
aldehyde. Particular preferred sources of native or recombinant PPA are 
prokaryotic organisms such as E.coli. Recombinant PPA may be isolated 
from E.coli strain pHSP 276 {CNCM 1-2189) deposited on April 23, 1999. 
This recombinant E.coli strain is transformed with a plasmid containing the 
deoC (phosphopentosealdolase) insert. The nucleotide sequence of the PPA 
gene and the corresponding amino acid sequence are shown in SEQ ID 
NO. 19 and 20 and 7 and 8. 

GAP is a highly unstable compound and, thus, should be generated in situ 
from suitable precursors which are preferably selected from fructose 1,6- 
diphosphate (FDP), dihydroxyacetone (DHA) and/or glycerolphosphate (GP), 
with FDP being preferred. 

FDP can be converted by an FDP aldolase (EC 4.1.2.13) selected from FDP 
aldolases 1 and FDP aldolases II to GAP and dihydroxyacetone phosphate 
(Ke^ == [FDPl/IGAP] x IDHAP] = 1 0^M"*). The two families of FDP aldolases 
giving identical end products (GAP and DHAP) via two chemically distinct 
pathways may be used for this reaction. FDP aldolase I forms Schiff base 
intermediates like PPA, and FDP aldolase II which uses metals (Zn^"*") 
covalently bound to the active sites to generate the end products. FDP- 
aldolase I is characteristic to eukaryotes, although it is found in various 
bacteria. FDP-aldolase II is more frequently encountered in prokaryotic 
organisms. If FDP-aldolase reacts with FDP in the presence of acetaldehyde, 
the latter compound can interact with DHAP to yield an undesired 
condensation by-product named deoxyxylolose 1 -phosphate (dXIP). Thus, 
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the reaction is preferably conducted in a manner by which the generation 
of undesired side products is reduced or completely suppressed. 

Particular preferred sources of native or recombinant FDP aldolases are 
prokaryotic or eukaryotic organisms. For example, FDP aldolase may be 
isolated from rabbit muscle. Further, FDP aldolase may be obtained from 
bacteria such as E.coli. Recombinant FDP aldolase may be isolated from 
recombinant E.coli strain pHSP 284 (CNCM 1-2190) which is transformed 
with a plasmid containing the E.coli fba (fructose diphosphate aldolase) 
insert. The nucleotide sequence of the E.coli FDP aldolase gene and the 
corresponding amino acid sequence are shown in SEQ ID NO, 9 and 10. 

On the other hand, GAP may be generated from DHAP and ATP, with 
dihydroxyacetone phosphate (DHAP) and ADP being formed and 
subsequent isomerization of DHAP to GAP in a reaction catalyzed by a 
glycerokinase (GK, EC 2.7. 1 .30) and a triose phosphate isomerase (TIM, EC 
5.3.1 .1). Suitable glycerokinases are obtainable from E.coli, suitable triose 
phosphate isomerases are obtainable from bovine or porcine muscle. 

In a still further embodiment of the present invention GAP may be generated 
from glycerol phosphate (GP) and Oj, with DHAP and HjO^ being formed 
and subsequent isomerization of DHAP to GAP in a reaction catalyzed by 
a glycerophosphate oxidase (GPO, EC 1.1.3.21) and a triose phosphate 
isomerase (TIM, EC 5.3.1.1). Suitable glycerophosphate oxidases are 
obtainable from Aerococcus viridans. 

In an alternative embodiment of the present invention deoxyribose 5- 
phosphate (dRSP) is generated by phosphorylation of deoxyribose. 
Preferably this reaction is carried out in the presence of a suitable enzyme, 
e.g. a deoxyribokinase (dRK, EC 2.7.1.5) which may be obtained from 
prokaryotic organisms, particularly Salmonella typhi and in the presence of 
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ATP. The nucleotide sequence of the Salmonella dRK gene and the 
corresponding amino acid sequence are shown in SEQ ID N0.1 1 and 12. 

By the reaction as outlined above deoxyribonucleosides are obtained which 
contain a nucleobase which is accepted by the enzymes TP and/or PNP. TP 
is specific for thymidine (T), uracil (U) and other related pyrimidine 
compounds. PNP uses adenine, guanine, hypoxanthine or other purine 
analogs as substrates. 

The synthesis of deoxyribonucleosides which are not obtainable by direct 
condensation such as deoxycytosine (dC), thus, require an additional 
enzymatic reaction, wherein a deoxyribonucleoside containing a first 
nucleobase is reacted with a second nucleobase, with a second 
ribonucleoside containing the second nucleobase being formed. The second 
nucleobase is preferably selected from cytosine and analogs thereof such 
as 5-azacytosine. It should be noted, however, that also other nucleobases 
such as 6-methyl purine, 2-amino-6-methylmercaptopurine, 6- 
dimethylaminopurine, 2,6-dichloropurine, 6-chIoroguanine, 6-chloropurine, 
6-azathymine, 5-fluorouracil, ethyl-4-amino-5-imidazole carboxylate, 
imidazole-4-carboxamlde and 1,2,4-trla2ole-3-carboxamide may be 
converted to the corresponding deoxyribonucleoside by this nucleobase 
exchange reaction (Beaussire and Pochet, Nucleosides & Nucleotides 14 
(1995), 805-808, Pochet et al., Bioorg.Med.Chem.Lett-5 (1 995), 1679- 
1684, Pochet and Dugue, Nucleosides & Nucleotides 17 (1998), 2003- 
2009, Pistotnik et aL, Anal.Biochem.271 (1999), 192-199). This reaction 
is preferably catalyzed by an enzyme called nucleoside 2- 
deoxyribosyltransferase (NdT, EC 2.4.2.6) which transfers the glycosyl 
moiety from a first deoxynucleoside to a second nucleobase, e.g. cytosine. 
A preferred source of native or recombinant NdT are prokaryotic organisms 
such as lactobacilli, particularly Lactobacillus leichmannii. Recombinant NdT 
may be isolated from recombinant E.coli strain pHSP 292 (CNCM 1-2191) 
deposited on April 23, 1999, which is transformed with a plasmid 
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containing the LJeichmannit NdT (nucleoside 2-deoxyribosyltransferase) 
insert. The nucleotide sequence of the NdT gene and the corresponding 
amino acid sequence are shown in SEQ ID NO. 13 and 14. 

A further aspect of the present invention is a method for the in vitro 
enzymatic synthesis of deoxyribonucleosides comprising the steps of: (i) 
condensing glyceraldehyde 3-phosphate (GAP) with acetaldehyde to 
deoxyribose 5-phosphate (dRBP), (ii) isomerizing deoxyribose 5-phosphate 
to deoxyribose 1 -phosphate (dRIP) and (iii) reacting deoxyribose 1- 
phosphate and a nucleobase, wherein a deoxyribonucleoside and inorganic 
phosphate are formed. Preferably, the reaction is carried out without 
isolating intermediate products and, more preferably, as a one-pot reaction. 
Further, the removal of the inorganic phosphate from the reaction is 
preferred. 

As outlined above, the glyceraldehyde 3-phosphate may be generated from 
FDP, DHA and/or GP. Preferably, FDP is used as a starting material. 

In order to avoid the production of undesired by-products and the toxic 
effects of acetaldehyde, the course of the reaction is preferably controlled 
by suitable means. Thus, preferably, the reaction is carried out in a manner 
such that the acetaldehyde concentration in step (ii) is comparatively low, 
e.g. less than 100 mM, particularly less than 50 mM, e.g. by adding the 
acetaldehyde in portions or continuously during the course of the reaction 
and/or by removing excess acetaldehyde. Further, it is preferred that before 
step (ii) excess starting materials and/or by-products, particularly fructose 
1,6-diphosphate and/or deoxyxylulose 1 -phosphate (dXIP), are removed. 
This removal may be effected by chemical and/or enzymatic methods, e.g. 
precipitating FDP with ferric salts or enzymatically degrading XI P via 
dihydroxyacetone phosphate. Alternatively or additionally the reaction 
conditions may be adjusted such that before step (ii) no substantial 
amounts, preferably less than 10 mM, of starting materials and/or by- 
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products, particularly fructose 1 ,6-diphosphate and/or deoxyxylulose 1- 
phosphate, are present in the reaction mixture. 

In still another embodiment, the present invention relates to a method for 
the in vitro enzymatic synthesis of deoxyribonucleosides comprising the 
steps of: (i) phosphorylating deoxyribose to deoxyribose 5-phosphate, (ii) 
isomerizing deoxyribose 5-phosphate to deoxyribose 1 -phosphate and (iii) 
reacting deoxyribose 1 -phosphate and nucleobase, wherein a 
deoxyribonucleoside and inorganic phosphate are formed. Preferably, these 
reactions are carried out with isolating intermediate products and, more 
preferably, as a one-pot reaction. To obtain a better yield the removal of 
inorganic phosphate from step (iii) is preferred. 

By the process as described above naturally occurring deoxyribonucleosides 
such as dA, dG, dT, dU and dT but also analogs thereof containing non- 
naturally occurring nucleobases and/or non-naturally occurring deoxyribose 
sugars such as 2'-deoxy-3'-azido-deoxyribose or 2'-deoxy-4'-thio-deoxy- 
ribose may be produced- 

The deoxyribonucleosides obtained may be converted to further products 
according to known methods. These further reaction steps may comprise 
the synthesis of deoxyribonucleoside mono-, di- or triphosphates, of H- 
phosphonates or phosphoramidites. Additionally or alternatively, labelling 
groups such as radioactive or chemical labelling groups may be introduced 
into the deoxyribonucleosides. 

Still a further aspect of the present invention is the use of an isolated 
nucleic acid molecule encoding a nucleoside 2-deoxyribosyl transferase 
(NdT, EC 2.4.2.6) for the preparation of an enzyme in an in vitro enzymatic 
synthesis process, wherein a deoxyribonucleoside containing a first 
nucieobase is reacted with a second nucleobase, with a 
deoxyribonucleoside containing the second nucleobase being formed. The 
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second nucleobase is preferably selected from cytidine and analogs thereof, 
2,6-dichloro-purine, 6-chloro-guanine, 6-chloro-purine, 6-aza-thymine and 
5-fluoro-uracil. The first nucleobase is preferably selected from thymine, 
guanine, adenine or uraciL 

More preferably, the nucleic acid molecule encoding an NdT comprises (a) 
the nucleotide sequence shown in SEQ ID NO. 13 or its complementary 
sequence, (b) a nucleotide sequence corresponding to the sequence of (a) 
in the scope of degeneracy of the genetic code or (c) the nucleotide 
sequence hybridizing under stringent conditions to the sequence (a) and/or 
(b). Apart from the sequence of SEQ ID NO. 13 the present invention also 
covers nucleotide sequences coding for the same polypeptide, i.e. they 
correspond to the sequence within the scope of degeneracy of the genetic 
code, and nucleotide sequence hybridizing with one of the above-mentioned 
sequences under stringent conditions. These nucleotide sequences are 
obtainable from SEQ ID NO. 13 by recombinant DNA and mutagenesis 
techniques or from natural sources, e.g. from other Lactobacillus strains. 

Stringent hybridization conditions in the sense of the present invention are 
defined as those described by Sambrook et aL, Molecular Cloning, A 
Laboratory Manual, Cold Spring Harbor Laboratory Press (1989), 1.101- 
1,104, According to this, hybridization under stringent conditions means 
that a positive hybridization signal is still observed after washing for one 
hour with 1 x SSC buffer and 0.1 % SDS at 55 °C, preferably at 62°C and 
most preferred at 68^C, in particular, for one hour in 0.2 x SSC buffer and 
0.1% SDS at 55°C, preferably at 62°C and most preferred at 68°C. 

Moreover, the present invention also covers nucleotide sequences which, 
on nucleotide level, have an identity of at least 70%, particularly preferred 
at least 80% and most preferred at least 90% to the nucleotide sequence 
shown in SEQ ID NO. 13. Percent identity are determined according to the 
following equation: 
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n 

I = X 100 

L 

wherein I are percent identity, L is the length of the basic sequence and n 
is the number of nucleotide or amino acid difference of a sequence to the 
basic sequence. 

Still another subject matter of the present invention is a recombinant vector 
comprising at least one copy of the nucleic acid molecule as defined above, 
operatively linked with an expression control sequence. The vector may be 
any prokaryotic or eukaryotic vector. Examples of prokaryotic vectors are 
chromosomal vectors such as bacteriophages (e.g. bacteriophage Lambda) 
and extrachromosomal vectors such as plasmids (see, for example, 
Sambrook et al., supra. Chapter 1-4). The vector may also be a eukaryotic 
vector, e.g. a yeast vector or a vector suitable for higher cells, e.g. a 
plasmaid vector, viral vector or plant vector. Suitable eukaryotic vectors are 
described, for example, by Sambrook et al., supra. Chapter 16. The 
invention moreover relates to a recombinant cell transformed with the 
nucleic acid or the recombinant vector as described above. The cell may be 
any cell, e.g. a prokaryotic or eukaryotic cell. Prokaryotic cells, in particular, 
E.coli cells, are especially preferred. 

The invention refers to an isolated polypeptide having NdT activity encoded 
by the above-described nucleic acid and its use for the preparation of 
deoxyribonucleosides. Preferably, the polypeptide has the amino acid 
sequence shown in SEQ ID NO. 14 or an amino acid sequence which is at 
least 70%, particulary preferred at least 80% and most preferred at least 
90% identical thereto, wherein the identity may be determined as described 
above. 
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Finally, the present invention also relates to the use of isolated nuclecic acid 
molecules having thymidine phosphorylase (TP), purine nucleoside 
phosphorylase (PNP), phosphopentose mutase (PPM), phosphopentose 
aldolase (PPA), FDP aldolase and deoxyribokinase (dRK) activity for the 
preparation of an enzyme for a method for the in vitro synthesis of 
deoxynucleosides. Preferably, these nucleic acids are selected (a) from a 
nucleotide sequence shown in SEQ ID N0.1, 3, 5, 7, 9 or 1 1 or their 
complementary sequences, (b) a nucleotide sequence corresponding to a 
sequence of (a) within the scope of degeneracy of the genetic code or (c) 
a nucleotide sequence hybridizing under stringent conditions to a sequence 
(a) and/or (b). 

Isolated polypeptides having TP, PNP, PPM, PPA, FDP aldolase or dRK 
activity encoded by the above-described nucleic acids may be used for the 
preparation of deoxyribonucleosides. Preferably, these polypeptides have 
the amino acid sequence shown in SEQ ID NO. 2, 4, 16, 6, 18, 8, 20, 10 
or 12 or an amino acid sequence which is at least 70%, particulary 
preferred at least 80% and most preferred at least 90% identical thereto, 
wherein the identity may be determined as described above. 

An isolated nucleic acid molecule encoding a dRK may be used for the 
preparation of an enzyme for an in vitro method for the enzymatic synthesis 
of deoxyribonucleosides comprising the step of phosphorylating deoxyribose 
to deoxyribose 5-phosphate, wherein said nucleic acid molecule comprises 
(a) the nucleotide sequence shown in SEQ ID N0.1 1 or its complementary 
sequence, (b) a nucleotide sequence corresponding to the sequence of (a) 
in the scope of the degeneracy of the genetic code or (c) a nucleotide 
sequence hybridizing under stringent conditions to the sequence of (a) 
and/or (b). Correspondingly, an isolated polypeptide having dRK activity is 
suitable for an in vitro method for the enzymatic synthesis of 
deoxyribonucleosides as outlined above. 
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The E.coli strains pHSP 282 (CNCM 1-2186), pHSP 283 (CNCM 1-2187), 
pHSP 275 (CNCM 1-2188), pHSP 276 (CNCM 2189), pHSP 284 (CNCM I- 
2190) and pHSP 292 (CNCM 1-2191) were deposited according to the 
regulations of the Budapest Treaty on April 23, 1999 at the Collection 
5 Nationale de Culture de Microorganismes, Institut Pasteur, 25, Rue de 
Docteur Roux, 75724 Paris Cedex 1 5. 

Description of figures 

10 Figure 1 shows the synthesis of dRSP according to Example 12. 

Figure 2 shows the synthesis of deoxyadenosine according to Example 
12. 



15 Figure 3 shows the synthesis of deoxyadensine according to Example 

13. 

Figure 4 shows the synthesis of dG-NHj according to Example 14. 
20 Example 1 

Sources of Enzymes 

L-glycerol 3-phosphate oxidase (1.1.3.21) from Aerococcus viridans, 
sucrose phosphorylase {2,4.1.7), fructose 6-phosphate kinase (2.7.1.90) 
25 from Propionibacterium freudenreichii, rabbit muscle aldolase (RAMA), 
formate dehydrogenase, glyceroiphosphate dehydrogenase (GDH), 
triosephosphate isomerase (TIM), catalase, glycerol 3-phosphate oxidase 
and maltose phosphorylase were obtained from commercial sources (Roche 
Diagnostics, Sigma) or as described in the literature. 

30 

FDP aldolase II (4.1.2.13), phosphopentose aldolase (PPA, EC 4.1.2.4), 
phosphopentose mutase (PPM, EC 5.4.2.7), thymidine phosphorylase (TP, 
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EC 2.4.2,4), purine nucleoside phosphorylase (PNP, EC 2.4.2. 1 ), nucleoside 
2-deoxyribosyl transferase (NdT, EC 2.4.2.6) were obtained from E.coli 
strains deposited at CNCM (see above). 

Example 2 

Protocol of the synthesis of deoxyadenosine 

Reaction mixture A was prepared by adding acetaldehyde (final 
concentration 250 mM), FDP aldolase II (0.5 U/ml), PPA (2.5 U/ml) to 20 
ml of 100 mM fructose-1 ,6-diphosphate (FDP), pH 7.6 and incubating 
overnight at 4°C. 

Mixture B was prepared by adding iVlnCl2 (final concentration 0.6 mM), 
glucose 1 ,6-diphosphate (15/yM), PPM (1 .5 U/ml), PNP (0.4 U/ml), SP (1.5 
U/ml) pentosephosphate aldolase, PPA (2 U/ml) and FDP aldolase 11 (0.5 
U/ml) to 10 ml 0.9 M sucrose, pH 7.6, at room temperature. 

2 ml of A were added over B at a temperature of 20 °C. After 1 hour 2.5 ml 
A were added. After another hour 3.0 ml A were added. After another 1 .5 
h 3.5 ml A were added. After another 1.5 h 4 ml A were added and after 
another 1-1. 5h 5 ml A were added and left to stand overnight. 

At each time of addition of A the amounts of FDP, dR5P, dXIP and dA in 
the reaction mixture were determined and the yield was calculated. The 
concentration of acetaldehyde was kept between 20-30 mM. The results 
are shown in Table 1 : 



wo 01/14566 



PCT/EPOO/08088 



- 17 - 
Table 1 



Time Volume 
(h) {ml) 


Concentrations (mM) 


Yield (mmol) 


dR5P dA dXIP 


dA 


0 12 


4 0 1.2 


0 


1 12 


3.4 3.2 1 


0.04 


2 14.5 


7.9 8.0 2.6 


0.12 


3.5 17.5 


13 16.2 4.3 


0.28 


5 21 


11.7 21.7 


0.46 


6 25 


23.7 


0.59 


22 30 


11 40.4 13.2 


1.21 


30 30 


50.3 


1.51 


54 30 


8.9 60.6 


1.82 



The starting amount of FDP was 1 .92 mmoL The amount after completion 
of reaction was 0.150 mmol. Thus, 1.77 mmol were consumed, 
theoretically corresponding to 3.54 mmol equivalents d A. The amount of dA 
formed was 1 .82 mmol, leading to a yield of 51 .4% based on the amount 
of FDP, 

Example 3 

Removal of excess FDP by means of FeCl3 

1.4 g (2.55 mmol) trisodium-fructose-1 ,6-disphosphate-octahydrate and 
430 fj\ (335 mg, 7.6 mmol) acetaldehyde were dissolved in 1 5 ml of water 
at 4°C. A pH of 7.9 was adjusted by means of sodium hydroxide solution. 
1 50 U pentosephosphate aldolase (PPA) were added, and cold water (4°C) 
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was added to give 20 ml. After addition of 50 U E.coli aldolase 11 the 
mixture was stored at 4^C. After 2 h another 75 U PPA and 50 p\ 
acetaldehyde (390 mg, 8.9 mmol) were added. After 20 h 500 U 
triosephosphate isomerase (TIM) were added. After 120 h the solution 
contained about 68 mM FDP, about 1 2 mM dXI P and about 45 mM dR5P. 
The reaction was stopped by adding 900 //I of a 2 M solution of iron(lll) 
chloride in 0.01 M hydrochloric acid. The precipitate was centrifuged and 
washed, the resulting solution contained about 4 mM dXI P, about 9 mM 
FDP and about 25 mM dR5P. 

Example 4 

Removal of excess FDP and dXIP by degradation via DHAP 

576mg(1 .05mmol)trisodium-fructose-1 ,6-disphosphate-octahydrate were 
dissolved in 8 ml water, and the pH was adjusted at 8.1 by means of 
sodium hydroxide solution. 75 U PPA and 27 U rabbit muscle aldolase 
(RAMA) were added, and water was added to give 10 ml. 570 jj\ (440 mg, 
1 0 mmol) acetaldehyde were added. The reaction was stored at 4^C. After 
100 h the solution contained about 110 mM dXIP, about 5 mM FDP and 
about 85 mM dR5P (about 870 //mol). The reaction was stopped by adding 
hydrochloric acid until a pH of 2 was reached- After adding sodium 
hydroxide solution to give a pH of 5.5 the solution was stored. 

For removing dXI P the acetaldehyde was evaporated and the solution was 
diluted with water to reach 30 ml. It was mixed with 3 ml 2.65 M sodium 
formate solution (8 mmol), and sodium hydroxide solution was added until 
a pH of 7.4 was reached. 23 U formate dehydrogenase (FDH), 6 mg NADH, 
1 6 U RAMA and 20 U glycerolphosphate dehydrogenase (GDH) were 
added.. 
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After 24 h at room temperature the concentrations of dXIP and FDP are 
below 3 mM, the loss of dR5P is less than 10%. 

Example 5 

Preparation of dR5P via G3P 

1.1 g (2,0 mmol) trisodium-fructose-1 ,6-disphosphate-octahydrate were 
dissolved in 8 ml water. 1 .58 mol of a 2.65 M sodium formate solution (4.2 
mmol) and 14.2 mg NADH were added. A pH of 7.0 was adjusted by 
means of NaOH. After addition of 36 U RAMA, 50 U triosephosphate 
isomerase (TIM), 34 U GDH and 35 U FDH water was added to give 12 ml. 

After incubation of 40 h at room temperature the FDP content was below 

3 mM. The enzymes were denatured by acidification with hydrochloric acid 
to reach a pH of 2. Subsequently, the pH of the solution was adjusted at 

4 and the solids were centrifuged and filtered off, respectively. Through 
dilution during purification a total volume of 25 ml was reached which 
contained about 160 mM of glycerol-3-phosphate (G3P). 

4 ml of this solution (about 640 //mol G3P) were adjusted at a pH of 7.8 by 
means of sodium hydroxide solution. 7.8 kU catalase, 500 U TIM and 13 
U glycerol 3-phosphate oxidase are added. The mixture was stirred very 
slowly in an open flask. After 30 min 18 U PPA were added. Acetaldehyde 
was added in portions of 30 //l(23.5 mg, 530 //mol) after 30, 60, 120, 180 
and 240 min. After 24 h another 15 U PPA, 2.5 kU TIM and 100 //I (78 mg, 
1.8 mmol) acetaldehyde were added. After 30 h the batch is sealed after 
addition of another 100 //! acetaldehyde. After a total of 45 h a 
concentration of about 60 mM. dR5P was achieved and the reaction is 
completed. For preparing 2'-deoxyadenosine (e.g. Example 7) excess 
acetaldehyde must be distilled off. 
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Example 6 

Preparation of a dR5P solution containing small amounts of dXIP or FDP 

A solution of 60 mmol/l FDP and 120 mmol/l acetaldehyde having pH 7.4 
was kept at a temperature of 15^C. 5 ml thereof were mixed with 4 U 
aldolase II, 2 U TIM and 40 U PPA and kept at 15°C. After 4, 8.5, 16.5 
and 24 h 12 U PPA and 100 /yl of a 34 vol.-% solution of acetaldehyde in 
water (26.4 mg, 600 ^mol) were added each. After 40 h the solution was 
allowed to reach room temperature. After 90 h the reaction solution had 
reached concentrations of about 3 mM FDP, about 4 mM dXI P and at least 
70 mM dR5P. For stopping the reaction and removing acetaldehyde about 
20% of the volume were distilled off. 

Example 7 

Preparation of deoxyadenosine (dA) from dR5P by means of barium acetate 

dR5P was used in the form of a solution prepared according to Examples 3r 
6. For instance, 10 ml of a solution of Example 6 diluted to have 70 mM 
dR5P (700 jjmol dR5P) were mixed with 40 mg (300 //mol) adenine, 41 //g 
(50 nmol) tetracyclohexylammonium-glucose-1 ,6-disphosphate, 396 ^/g (2 
//mol) manganese-ll-acetate-tetrahydrate, 10 U pentosephosphate mutase 
(PPM) and 30 U purine-nucleoside phosphorylase (PNP). After 3 h another 
27 mg {200 //mol) adenine and 26 mg (100 ^mol) barium acetate were 
added. 

A further amount of 26 mg barium acetate was added after 4 h, one of 40 
mg adenine after 7 h. After 10 h the reaction was completed. The solution 
had a concentration of 45 mM dA. 
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Example 8 

Preparation of deoxyadenosine (dA) from dR5P by means of sucrose 
phosphorylase 

10 ml of a solution of Example 6 diluted to 55 mM dR5P (550A/mol dR5P) 
were mixed with 81 mg (600 /ymol) adenine, 41 //g (50 nmol) 
tetracyclohexylammoniym-glucose-1 ,6-disphosphate, 396 jjg (2 pmo\) 
manganese-ll-acetate-tetrahydrate, 10 U pentosephosphate mutase (PPM) 
1 5 U purine nucleoside phosphorylase (PNP), 25 U sucrose phosphorylase 
and 340 mg (1 mmoi) cane sugar. 

After 3 h at room temperature the reaction was completed. The solution 
had a concentration of about 50 mM dA. 

Example 9 

Preparation of deoxyadenosine (dA) from dR5P by means of maltose 
phosphorylase 

10 ml of a solution of dR5P diluted to 55 mM were mixed at pH 7.0 with 
81 mg (600 /jmo\) adenine, 41 jjg (50 nmoles) glucose 1,6-diphosphate, 
396 fjg (2 //moles) manganese ll-acetate tetrahydrate, 5 units pentose 
phosphate mutase (PPM), 1 0 units purine nucleoside phosphorylase, (PNP), 
20 units maltose phosphorylase and 1080 mg (3 mmoles) maltose. 

After 12h at room temperature the reaction was completed. The solution 
had a concentration of 49 mM dA. 
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Example 10 

Preparation of deoxycytosine (dC) from dR5P by means of sucrose 
phosphorylase 

20 ml of a solution of dR5P diluted to 70 mM were mixed at pH 7.0 with 
5.4 mg adenine (0.04 mmoles), 1 55 mg cytosine (1 .4 mmoles), 82//g (100 
nmoles) glucose 1 ,6-diphosphate, 792//g {4/ymoles) manganese ll-acetate- 
tetrahydrate, 20 units PPM, 30 units PNP, 50 units 2-deoxyribosyl 
transferase (NdT), 50 units sucrose phosphorylase and 2.05 g (6 mmoles) 
sucrose. 

After 18h at 30°C the solution had a concentration of 62 mM dC. 
Example 11 

Preparation of deoxyguanosine (dG) from dR5P by means of sucrose 
phosphorylase 

20 ml of a solution of dR5P diluted to 70 mM were mixed at pH 7.0 with 
91 mg guanine (0.6 mmoles), 82 //g (100 nmoles) glucose 1,6- 
diphosphate, 792//g (4;/moles) manganese ll-acetate-tetrahydrate, 20 units 
PPM, 10 units PNP, 20 units sucrose phosphorylase and 2.05 g (6 mmoles) 
sucrose. 

After 18h at 37**C the dG formed corresponds to 0.5 mmoles. 
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Example 12 

Two step procedure of dA synthesis 

In the first step dR5P was prepared by adding FDP-Aldolase II (Aldll) from 
E. coli, pentosephosphate aldolase (PPA) from E. coli and triosephosphate 
isomerase (TIM) from E. coli to fructose-1 -6-bisphosphate (FDP) and 
acetaldehyde (AcAld) essentially according to Ex. 6. FDPtrisodium salt was 
mixed in a final concentration of 75 mM with Ac Aid (100 mM final 
concentration). The pH was adjusted to 7,4 by addition of sodium 
hydroxide. The reaction was started by adding PPA (12 kU/l), Aid 11 (0,3 
kU/l) and TIM (2,5 kU/l). At 4 h 1 1 7 mM AcAld, at 7 h 11 7 mm AcAld, PPA 
6 kU/l, TIM 2,5 kU/l and at 1 2 h 1 1 7 mM AcAid were added. The reaction 
was run at 21 "C. Conversion was monitored by enzymatical assay using 
step by step glycerol'3-phosphate dehydrogenase (GDH), rabbit muscle 
aldolase (RAMA), trisosephosphate isomerase (TIM), pentosephosphate 
aldolase (PPA) in the presence of NADH (0,26 mM in 300 mM triethanol 
amine buffer pH 7.6). Conversion is shown in Fig. 1. 

After yielding approx. 95 mM dR5P the enzymes were deactivated by 
heating to 65 *C for 10 min. and excess of AcAld was removed by 
evaporation. In the second step dR5P in a final concentration of 64 mM was 
converted to deoxyadenosine (dA) by adding adenine (A, final concentration 
58 mM) in the presence of 300 //M MnClj, 5//M Glucose-1 .6-bisphosphate, 
pentosephosphate mutase from E. coli (PPM, 2 kU/l), purine nucleoside 
phosphorylase from E. coli (PNP, 1 kU/I). The synthesis was run at 20 *C, 
pH 7.4. In one experiment 200 mM sucrose and 0.6 kU/l sucrose 
phosphorylase (SP) from Leuconostoc mes, were added at t = 2 h (see 
arrow in Fig. 2, rhombus, solid line), in a second experiment addition of SP 
was omitted (squares, dotted line). The conversion was monitored by RP- 
HPLC (column Hypersil ODS 5 //m, 250 x 4,6 mm; eluent: 30 mM 
potassium phosphate, 5 mM tetrabutyl ammoniumhydrogensulfate pH 6.0/ 
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1 % acetonitrile, flow rate: 1 ml/min, column temp.: 35 'C, det.: UV at 260 
nm) and is shown in Fig. 2. 

Example 13 

dR5P was prepared by adding FDP-Aldolase II (Aldll) from E. coll, 
pentosephosphate aldolase (PPA) from E. coll and trisosephosphate 
isomerase (TIM) from E. coli to fructose- 1 .6-bisphosphate (FDP) and 
acetaldehyde (AcAld) essentially according to Ex. 6. Excess of AcAld was 
removed by evaporation. dR5P in a final concentration of 60 mM was 
converted to deoxyadenosine (d A) by adding adenine {A, final concentration 
58 mM) in the presence of 300 //M MnClj, 5 //M Glucose-1 .6-bisphosphate, 
pentosephosphate mutase from E. coli (PPM, 1,5 kU/l), purine nucleoside 
phosphorylase from E. coli (PNP, 1 kU/l). The synthesis was run at 20*C, 
pH 7.4. After 24 h sucrose in a final concentration of 200 mM and sucrose 
phosphorlyase from Leuconsotoc mes. (1 kU/l) were added. Conversion was 
monitored by RP-HPLC (dA, A, see ex. 12)) resp. enzymatical assay (dR5P, 
using step by step glycerol-3-phosphate dehydrogenase (GDH), rabbit 
muscle aldolase (RAMA), trisosephosphate isomerase (TIM), 
pentosephosphate aldolase (PPA) in the presence of NADH (0,26 mM in 
300 mM Triethanol amine buffer pH 7.6)) and phosphomolybdate 
complexing of inorg. phosphate (Sigma, Proc. No. 360-UV). This is shown 
in Fig. 3. 

Example 14 

dR5P was essentially prepared according according to Ex. 6. dR5P in a final 
concentration of 80 mM was then converted to deoxy-6-aminoguanosine 
(dG-NH^) by adding 2,6-Diaminopurine (DAP, final concentration 77 mM) in 
the presence of 200 mM sucrose, 300 pM MnClj, 5 pM Glucose-1.6- 
bisphosphate, pentosephosphate mutase from E. coli (PPM, 2,5 kU/l), 
purine nucleoside phosphorylase from E. coli (PNP, 1 kU/l), sucrose 
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phosphorylase from Leucoonostoc mes. (SP, 1 ,5 kU/1) . The synthesis was 
run at 20' C pH 7.4. After 2,5h, 5 h and 20,5 h additional amounts of 
enzymes were added: 2,5 h PPM (2,5 kU/l), PNP (1 kU/1, SP (1,5 kU/l), 5 
h PPM (2,5 kU/l), SP (1,5 kU/1), 20,5 h: PPM (2,5 kU/l), SP (1,5 kU/l). The 
conversion was monitored by RP-HPLC (column Hypersil ODS 5//m, 250 x 
4,6 mm; eluent: 30 mM potassium phosphate, 5 mM tetrabutyl 
ammoniumhydrogensulfate pH 6.0/ 1 % acetonitrile, flow rate: 1 ml/min, 
column temp.: 35 "C, det.: UV at 216 nm) and is shown in Fig. 4. 
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New Claims 

1 . A method for the in vitro enzymatic synthesis of deoxyribonucleosides 
comprising reacting deoxyribose 1 -phosphate (dRI P) and a nucleobase, 
wherein a deoxyribonucleoside and inorganic phosphate are formed 
and wherein the inorganic phosphate is removed. 

2. The method of claim 1, wherein the reaction is catalyzed by a 
thymidine phosphorylase (TP, EC 2.4.2.4) or a purine nucleoside 
phosphorylase {PNP, EC 2.4.2.1). 

3. The method of any one of the previous claims, wherein the nucleobase 
is selected from the group consisting of thymine, uracil, adenine, 
guanine and hypoxanthine and analogs thereof, e.g. 2-thio-uracil, 6- 
aza-uracil, 5-carboxy-2-thio-uracil, 6-aza-thymine, 6-aza-2-thio-thymine 
and 2,6-diamino-purine. 

4. The method of any one of the previous claims, wherein the removal of 
the inorganic phosphate is effected by (i) conversion to inorganic 
pyrophosphate, (ii) precipitation, (iii) complexation and/or (iv) substrate 
phosphorylation. 

5. The method of claim 4, wherein the inorganic phosphate is converted 
to pyrophosphate by a phosphate transfer from fructose-diphosphate 
(FDP) under formation of fructose-6-phosphate (F6P). 

6. The method of claim 5, wherein the phosphate transfer is catalyzed by 
a PPi-dependent phosphofructokinase (PFK-RPi, EC 2.7.1.90). 
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The method of claim 5 or 6, wherein the inorganic pyrophosphate is 
removed by precipitation. 



8. The method of claim 4, wherein the inorganic phosphate is transferred 
to a disaccharide, particularly sucrose or maltose under formation of a 
monosaccharide and a phosphorylated monosaccharide. 

9. The method of claim 8, wherein the phosphate transfer is catalyzed by 
a sucrose phosphorylase (EC 2.4.1 .7) or a maltose phosphorylase (EC 
2.4.1.8). 

1 0. The method of claim 9, wherein the phosphorylated monosaccharide is 
further reacted. 

1 1 . The method of any one of the previous claims, wherein the 
deoxyribose-1 -phosphate is generated from deoxyribose 5-phosphate 
(dR5P). 

12. The method of claim 11, wherein the reaction is catalyzed by a 
deoxyribomutase (EC 2.7.5.1) or a phosphopentose mutase (PPM, EC 
5.4.2.7). 

13. The method of claim 11 or 12, wherein the deoxyribose-5-phosphate 
is generated by a condensation of glyceraldehyde 3-phosphate (GAP) 
with acetaldehyde. 

14. The method of claim 13, wherein the reaction is catalyzed by a 
phosphopentose aldolase (PPA, EC 4.1.2.4). 

15. The method of claim 13 or 14, wherein the glyceraldehyde 3- 
phosphate is generated from fructose 1 ,6-diphosphate. 
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dihydroxyacetone (DHA) and/or glycerolphosphate. 

3. The method of claim 15, wherein the glyceraldehyde 3-phosphate is 
generated from fructose 1 ,6-diphosphate in a reaction catalyzed by an 
FDP-aldolase {EC 4.1.2,13) selected from FDP-aldolases I and FDP- 
aldolases IL 

\ The method of claim 15, wherein the glyceraldehyde 3-phosphate is 
generated from dihydroxyacetone and ATP under formation of 
dihydroxyacetone phosphate (DHAP) and ADP and subsequent 
isomerization of DHAP to GAP in a reaction catalyzed by a 
glycerokinase (GK, EC 2.7.1-30) and a triose phosphate isomerase 
(TIM, EC 5.3.1.1). 

. The method of claim 15, wherein the glyceraldehyde 3-phosphate is 
generated from glycerol phosphate (GP) and O2 under formation of 
dihydroxyacetone phosphate (DHAP) and HjOj and subsequent 
isomerization of DHAP to GAP in a reaction catalyzed by a 
glycerophosphate oxidase (GPO, EC 1 .1 .3.21) and a triose phosphate 
isomerase (TIM, EC 5.3.1.1). 

The method of claim 1 1 or 12, wherein the deoxyribose 5-phosphate 
is generated by a phosphorylation of deoxyribose. 

The method of claim 19, wherein the reaction is catalyzed by a 
deoxyribokinase (dRK, EC 2.7.1.15). 

The method of claim 20, wherein a dRK obtainable from Salmonella 
typhi is used which is encoded by (a) the nucleotide sequence shown 
in SEQ ID NO. 11 or its complementary sequence, (b) a nucleotide 
sequence corresponding to the sequence of (a) in the scope of the 
degeneracy of the genetic code or (c) ^a nucleotide sequence 
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hybridizing under stringent conditions to the sequence of (a) and/or (b). 



22. The method of any one of the previous claims, wherein a 
deoxyribonucleoside containing a first nucleobase is further reacted 
with a second nucleobase under formation of a deoxyribonucleoside 
containing the second nucleobase. 



23- The method of claim 22, wherein said second nucleobase is selected 
from cytidine and analogs thereof, e.g. 5-aza-cytidine, 2,6-dichloro- 
purine, 6-chloro-guanine, 6-chloro-purine, 6-aza-thymine and 5-f!uoro- 
uracil. 



24. The method of claim 23, wherein the reaction is catalyzed by a 
nucleoside 2-deoxyribosyl transferase (NdT, EC 2.4.2.6). 

25. The method of claim 24, wherein an NdT obtainable from Lactobacillus 
leichmannii is used which is encoded by (a) the nucleotide sequence 
shown in SEQ ID NO. 13 or its complementary sequence, (b) a 
nucleotide sequence corresponding to the sequence of (a) in the scope 
of the degeneracy of the genetic code or (c) a nucleotide sequence 
hybridizing under stringent conditions to the sequence of (a) and/or (b). 



26. A method for the in vitro enzymatic synthesis of deoxyribonucleosides 
comprising the steps of: 

(i) condensing glyceraldehyde 3-phosphate (GAP) with 
acetaldehyde to deoxyribose 5-phosphate (dR5P), 

(ii) isomerizing deoxyribose 5-phosphate to deoxyribose 1- 
phosphate (dRIP) and 

(iii) reacting deoxyribose 1 -phosphate and nucleobase, 
wherein a deoxyribonucleoside and inorganic phosphate 
are formed. 



AMENDEb SHEET 





- 5 - 



27. The method of claim 26, wherein the reaction is carried out without 
isolating intermediate products. 

28. The method of claim 26 or 27, wherein the glyceraldehyde 3- 
phosphate (GAP) is generated from fructose 1 ,6-diphosphate (FDP), 
dihydroxy-acetone (DHA) and/or glycerolphosphate (GP). 

29. The method of claims 26 to 28, wherein before step (ji) excess 
acetaldehyde is removed. 

30. The method of claims 26 to 29, wherein before step (ii) excess starting 
materials and/or by-products, particularly fructose 1 ,6-diphosphate 
and/or deoxyxylulose 1 -phosphate (dXIP) are removed- 



31 . The method of claims 26 to 29, wherein the reaction is carried out in 
a manner that before step (ii) no substantial amounts of starting 
materials and/or by-products, particularly fructose 1 ,6-diphosphate 
and/or deoxyxylulose 1 -phosphate are present. 

32. A method for the in vitro enzymatic synthesis of deoxyribonucleosides 
comprising the steps of: 

(i) phosphorylating deoxyribose to deoxyribose 5-phosphate, 

(ii) isomerizing deoxyribose 5-phosphate to deoxyribose 1- 



(iii) 



phosphate and 

reacting deoxyribose 1 -phosphate and nucleobase, 
wherein a deoxyribonucleoside an inorganic phosphate 



are formed. 



33. The method of claim 32, wherein the reaction is carried out without 
isolating intermediate products. 
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tjr 34. The method of claims 26 to 33, wherein the inorganic phosphate is 
removed. 

35. The method of any one of the previous claims comprising further 
reacting said deoxyribonucleoside. 

36. The method of claim 35, wherein said further reacting comprises the 
synthesis of deoxyribonucleoside mono-, di- or triphosphates, of H- 
phosphonates or of phosphoramidites. 



37. The use of an isolated nucleic acid molecule encoding a nucleoside 2- 
deoxyribosyl transferase {NdT, EC 2.4.2,6) for the preparation of an 
enzyme for an in vitro method for the enzymatic synthesis of 
deoxyribonucleosides, wherein a deoxyribonucleoside containing a first 
nucleobase is further reacted with a second nucleobase un'der 
formation of a deoxyribonucleoside containing the second nucleobase, 
wherein said nucleic acid molecule comprises (a) the nucleotide 
sequence shown in SEQ ID NO. 13 or its complementary sequence, (b) 
a nucleotide sequence corresponding to the sequence of (a) in the 
scope of the degeneracy of the genetic code or (c) a nucleotide 
sequence hybridizing under stringent conditions to the sequence of (a) 
and/or (b). 

38. The use of claim 37, wherein the second nucleobase is selected from 
cytidine and analogs thereof, e.g. 6-methyl purine, 2-amino-6- 
methylmercaptopurine, 6-dimethylaminopurine, 5-azacytidine, 2,6- 
dichloropurine, 6-chloroguanine, 6-chloropurine, 6-azathymine, 5- 
fluorouracil, ethyl-4-amino-5-imidazole carboxylate, imidazole-4- 
carboxamide and 1 ,2,4-triazole-3-carboxamide, 

39. The use of claim 37 or 38, wherein the first nucleobase is selected 
from adenine, guanine, thymine, uracil and hypoxanthine. 
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40. The use of any one of claims 37-39, wherein the nucleic acid molecule 
is contained on a recombinant vector in operative linkage with an 
expression control sequence. 

41. The use of any one of claims 37-40, wherein the nucleic acid is 
contained in a recombinant cell. 

42. Use of an isolated polypeptide having NdT activity for the preparation 
of nucleosides according to claim 23, 

43. Use of an isolated nucleic acid molecule encoding a deoxyribokinase 
(dRK, EC 2.7.1.5) for the preparation of an enzyme for an in vitro 
method for the enzymatic synthesis of deoxyribonucleosides 
comprising the step of phosphorylating deoxyribose to deoxyribose 5- 
phosphate, wherein said nucleic acid molecule comprises (a) 'the 
nucleotide sequence shown in SEQ ID NO. 11 or its complementary 
sequence, (b) a nucleotide sequence corresponding to the sequence of 
(a) in the scope of the degeneracy of the genetic code or (c) a 
nucleotide sequence hybridizing under stringent conditions to the 
sequence of (a) and/or (b). 

44. Use of an isolated polypeptide having dRK activity for an in vitro 
method for the enzymatic synthesis of deoxyribonucleosides 
comprising the step of phosphorylating deoxyribose to deoxyribose 5- 
phosphate. 

45. Recombinant bacteria strains deposited at CNCM under accession 
numbers 1-2186, 1-2187, 1-2188, 1-2189, 1-2190 and 1-2191. 
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<110> Tischer, Wilhelm 

Ihlenfeldt, Hans-Georg 
Barzu, Octavian 
Sakamoto, Hiroshi 
Pistotnik, Elisabeth 
Marliere, Philippe 
Pochet, Sylvie 

<120> Enzymatic synthesis of deoxyribonucleosides 

<130> 20373PWO Deoxyribonucleosides 

<14 0> To be assigned 
<141> 2002-02-15 

<150> EP99116425.2 
<151> 1999-08-20 

<160> 20 

<170> Patentin Ver. 2.1 

<210> 1 
<211> 1323 
<212> DMA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . (1320) 
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<400> 1 

ttg ttt etc gca caa gaa att att cgt aaa aaa cgt gat ggt cat gcg 4 8 
Leu Phe Leu Ala Gin Glu lie lie Arg Lys Lys Arg Asp Gly His Ala 
15 10 15 



ctg age gat gaa gaa att cgt ttc ttt ate aac ggt att ege gac aac 96 
Leu Ser Asp Glu Glu lie Arg Phe Phe lie Asn Gly lie Arg Asp Asn 
20 25 30 



act ate tec gaa ggg cag att gcc gee etc gcg atg acc att ttc ttc 144 
Thr lie Ser Glu Gly Gin lie Ala Ala Leu Ala Met Thr lie Phe Phe 
35 40 45 



cac gat atg aca atg cct gag cgt gtc teg ctg acc atg gcg atg cga 192 
His Asp Met Thr Met Pro Glu Arg Val Ser Leu Thr Met Ala Met Arg 
50 55 60 

gat tea gga ace gtt etc gac tgg aaa age ctg cat ctg aat ggc ccg 240 
Asp Ser Gly Thr Val Leu Asp Trp Lys Ser Leu His Leu Asn Gly Pro 
65 70 75 80 



att gtt gat aaa cac tec ace ggt ggc gtc ggc gat gtg act teg ctg 2 88 
lie Val Asp Lys His Ser Thr Gly Gly Val Gly Asp Val Thr Ser Leu 
85 90 95 



atg ttg ggg ccg atg gtc gca gee tgc ggc ggc tat att ccg atg ate 336 
Met Leu Gly Pro Met Val Ala Ala Cys Gly Gly Tyr lie Pro Met lie 
100 105 110 



tct ggt ege ggc etc ggt cat act ggc ggt acg etc gac aaa ctg gaa 3 84 
Ser Gly Arg Gly Leu Gly His Thr Gly Gly Thr Leu Asp Lys Leu Glu 
115 120 125 
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432 



480 



528 



576 



tec ate cct ggc ttc gac att ttc ccg gat gac aac cgt ttc cgc gaa 
Ser He Pro Gly Phe Asp He Phe Pro Asp Asp Asn Arg Phe Arg Glu 
130 135 140 

att att aaa gac gtc ggc gtg gcg att ate ggt cag ace agt tea ctg 
He He Lys Asp Val Gly Val Ala He He Gly Gin Thr Ser Ser Leu 

150 155 160 

get ccg get gat aaa cgt ttc tac gcg acc cgt gat att acc gca acc 
Ala Pro Ala Asp Lys Arg Phe Tyr Ala Thr Arg Asp He Thr Ala Thr 
165 170 175 

gtg gac tec ate ccg ctg ate ace gee tet att ctg gcg aag aaa ett 
Val Asp Ser He Pro Leu He Thr Ala Ser He Leu Ala Lys Lys Leu 
180 185 190 



gcg gaa ggt ctg gac gcg ctg gtg atg gae gtg aaa gtg ggt age ggc 624 
Ala Glu Gly Leu Asp Ala Leu Val Met Asp Val Lys Val Gly Ser Gly 
1^5 200 205 

gcg ttt atg ccg acc tac gaa etc tet gaa gcc ctt gcc gaa gcg att 672 
Ala Phe Met Pro Thr Tyr Glu Leu Ser Glu Ala Leu Ala Glu Ala He 
210 215 220 

gtt ggc gtg get aac ggc get ggc gtg cgc ace ace gcg ctg etc ace 72 0 
Val Gly Val Ala Asn Gly Ala Gly Val Arg Thr Thr Ala Leu Leu Thr 

230 235 240 

gac atg aat cag gta ctg gcc tec agt gca ggt aac gcg gtt gaa gtt 768 
Asp Met Asn Gin Val Leu Ala Ser Ser Ala Gly Asn Ala Val Glu Val 
245 250 255 
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cgt gaa gcg gtg cag ttc ctg acg ggt gaa tat cgt aac ccg cgt ctg 816 
Arg Glu Ala Val Gin Phe Leu Thr Gly Glu Tyr Arg Asn Pro Arg Leu 
260 265 270 

ttt gat gtc acg atg gcg ctg tgc gtg gag atg ctg ate tec ggc aaa 864 
Phe Asp Val Thr Met Ala Leu Cys Val Glu Met Leu He Ser Gly Lys 
275 280 285 

ctg gcg aaa gat gac gcc gaa gcg cgc gcg aaa ttg cag gcg gtg ctg 912 
Leu Ala Lys Asp Asp Ala Glu Ala Arg Ala Lys Leu Gin Ala Val Leu 
290 295 300 

gac aac ggt aaa gcg gca gaa gtc ttt ggt cgt atg gta gcg gca caa 960 
Asp Asn Gly Lys Ala Ala Glu Val Phe Gly Arg Met Val Ala Ala Gin 
305 310 315 320 

aaa ggc ccg acc gac ttc gtt gag aac tac gcg aag tat ctg ccg aca 1008 
Lys Gly Pro Thr Asp Phe Val Glu Asn Tyr Ala Lys Tyr Leu Pro Thr 
325 330 335 

gcg atg ctg acg aaa gca gtc tat get gat acc gaa ggt ttt gtc agt 1056 
Ala Met Leu Thr Lys Ala Val Tyr Ala Asp Thr Glu Gly Phe Val Ser 
340 345 350 

gaa atg gat acc cgc gcg ctg ggg atg gca gtg gtt gca atg ggc ggc 1104 
Glu Met Asp Thr Arg Ala Leu Gly Met Ala Val Val Ala Met Gly Gly 
355 360 365 

gga cgc cgt cag gca tct gac acc ate gat tac age gtc ggc ttt act 1152 
Gly Arg Arg Gin Ala Ser Asp Thr He Asp Tyr Ser Val Gly Phe Thr 
370 375 380 
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gat atg gcg cgt ctg ggc gac cag gta gac ggt cag cgt ccg ctg gcg 1200 
Asp Met Ala Arg Leu Gly Asp Gin Val Asp Gly Gin Arg Pro Leu Ala 
385 390 395 400 

gtt ate cac gcg aaa gac gaa aac aac tgg cag gaa gcg gcg aaa gcg 1248 
Val He His Ala Lys Asp Glu Asn Asn Trp Gin Glu Ala Ala Lys Ala 
405 410 415 

gtg aaa gcg gca att aaa ctt gcc gat aaa gca ccg gaa age aca cca 1296 
Val Lys Ala Ala He Lys Leu Ala Asp Lys Ala Pro Glu Ser Thr Pro 
420 425 430 

act gtc tat cgc cgt ate age gaa taa 1323 
Thr Val Tyr Arg Arg He Ser Glu 
435 440 



<210> 2 
<211> 440 
<212> PRT 

<213> Escherichia coli 
<400> 2 

Leu Phe Leu Ala Gin Glu He He Arg Lys Lys Arg Asp Gly His Ala 
15 10 15 

Leu Ser Asp Glu Glu He Arg Phe Phe He Asn Gly He Arg Asp Asn 
20 25 30 

Thr He Ser Glu Gly Gin He Ala Ala Leu Ala Met Thr He Phe Phe 
35 40 45 
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His Asp Met Thr Met Pro Glu Arg Val Ser Leu Thr Met Ala Met Arg 
50 55 60 

Asp Ser Gly Thr Val Leu Asp Trp Lys Ser Leu His Leu Asn Gly Pro 
65 70 75 80 

lie Val Asp Lys His Ser Thr Gly Gly Val Gly Asp Val Thr Ser Leu 
85 90 95 

Met Leu Gly Pro Met Val Ala Ala Cys Gly Gly Tyr lie Pro Met lie 
100 105 110 

Ser Gly Arg Gly Leu Gly His Thr Gly Gly Thr Leu Asp Lys Leu Glu 
115 120 125 

Ser lie Pro Gly Phe Asp He Phe Pro Asp Asp Asn Arg Phe Arg Glu 
130 135 140 

lie He Lys Asp Val Gly Val Ala He He Gly Gin Thr Ser Ser Leu 
145 150 155 160 

Ala Pro Ala Asp Lys Arg Phe Tyr Ala Thr Arg Asp He Thr Ala Thr 
165 170 175 

Val Asp Ser He Pro Leu He Thr Ala Ser He Leu Ala Lys Lys Leu 
180 185 190 

Ala Glu Gly Leu Asp Ala Leu Val Met Asp Val Lys Val Gly Ser Gly 
195 200 205 

Ala Phe Met Pro Thr Tyr Glu Leu Ser Glu Ala Leu Ala Glu Ala He 
210 215 220 
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Val Gly Val Ala 
225 

Asp Met Asn Gin 

Arg Glu Ala Val 
260 

Phe Asp Val Thr 
275 

Leu Ala Lys Asp 
290 

Asp Asn Gly Lys 
305 

Lys Gly Pro Thr 

Ala Met Leu Thr 
340 

Glu Met Asp Thr 
355 

Gly Arg Arg Gin 
370 

Asp Met Ala Arg 
385 



Asn Gly Ala Gly 
230 

Val Leu Ala Ser 
245 

Gin Phe Leu Thr 



Met Ala Leu Cys 
280 

Asp Ala Glu Ala 
295 

Ala Ala Glu Val 
310 

Asp Phe Val Glu 
325 

Lys Ala Val Tyr 

Arg Ala Leu Gly 
360 

Ala Ser Asp Thr 
375 

Leu Gly Asp Gin 
390 
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Val Arg Thr Thr 
235 

Ser Ala Gly Asn 
250 

Gly Glu Tyr Arg 
265 

Val Glu Met Leu 

Arg Ala Lys Leu 
300 

Phe Gly Arg Met 
315 

Asn Tyr Ala Lys 
330 

Ala Asp Thr Glu 
345 

Met Ala Val Val 

lie Asp Tyr Ser 
380 

Val Asp Gly Gin 
395 




Ala Leu Leu Thr 
240 

Ala Val Glu Val 
255 

Asn Pro Arg Leu 
270 

lie Ser Gly Lys 
285 

Gin Ala Val Leu 

Val Ala Ala Gin 
320 

Tyr Leu Pro Thr 
335 

Gly Phe Val Ser 
350 

Ala Met Gly Gly 
365 

Val Gly Phe Thr 

Arg Pro Leu Ala 
400 
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Val He His Ala Lys Asp Glu Asn Asn Trp Gin Glu Ala Ala Lys Ala 
405 410 415 

Val Lys Ala Ala He Lys Leu Ala Asp Lys Ala Pro Glu Ser Thr Pro 
420 425 430 

Thr Val Tyr Arg Arg lie Ser Glu 
435 440 



<210> 3 
<211> 720 
<212> DNA 

<213> Escherichia coli 



<220> 

<221> CDS 

<222> (1) . . (717) 



<400> 3 

atg get acc cca cac att aat gca 
Met Ala Thr Pro His He Asn Ala 
1 5 

gtt ttg atg cca ggc gac ccg ctg 
Val Leu Met Pro Gly Asp Pro Leu 
20 



gaa atg ggc gat ttc get gac gta 48 

Glu Met Gly Asp Phe Ala Asp Val 
10 15 

cgt gcg aag tat att get gaa act 96 

Arg Ala Lys Tyr lie Ala Glu Thr 
25 30 



ttc ctt gaa gat gee cgt gaa gtg aac aac gtt cgc ggt atg ctg ggc 144 
Phe Leu Glu Asp Ala Arg Glu Val Asn Asn Val Arg Gly Met Leu Gly 
35 40 45 
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ttc acc ggt act tac aaa ggc cgc aaa att tec gta atg ggt cac ggt 192 
Phe Thr Gly Thr Tyr Lys Gly Arg Lys He Ser Val Met Gly His Gly 
50 55 60 

atg ggt ate ccg tec tgc tee ate tac ace aaa gaa etg ate ace gat 240 
Met Gly He Pro Ser Cys Ser He Tyr Thr Lys Glu Leu He Thr Asp 
65 70 75 80 

ttc ggc gtg aag aaa att ate cgc gtg ggt tec tgt ggc gca gtt ctg 288 
Phe Gly Val Lys Lys He He Arg Val Gly Ser Cys Gly Ala Val Leu 
85 90 95 

ccg cac gta aaa ctg cgc gae gtc gtt ate ggt atg ggt gee tgc ace 336 
Pro His Val Lys Leu Arg Asp Val Val He Gly Met Gly Ala Cys Thr 
100 105 110 

gat tec aaa gtt aac cgc ate egt ttt aaa gae eat gae ttt gee get 384 
Asp Ser Lys Val Asn Arg He Arg Phe Lys Asp His Asp Phe Ala Ala 
115 120 125 

ate get gae ttc gae atg gtg egt aac gca gta gat gca get aaa gca 432 
He Ala Asp Phe Asp Met Val Arg Asn Ala Val Asp Ala Ala Lys Ala 
130 135 140 

ctg ggt att gat get cgc gtg ggt aac etg ttc tec get gae etg ttc 480 
Leu Gly He Asp Ala Arg Val Gly Asn Leu Phe Ser Ala Asp Leu Phe 

150 155 160 

tac tet ccg gae ggc gaa atg ttc gae gtg atg gaa aaa tac ggc att 528 
Tyr Ser Pro Asp Gly Glu Met Phe Asp Val Met Glu Lys Tyr Gly He 
165 170 175 
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etc ggc gtg gaa atg gaa gcg get ggt ate 
Leu Gly Val Glu Met Glu Ala Ala Gly lie 
180 185 

ttt ggc gcg aaa gee ctg acc ate tgc ace 
Phe Gly Ala Lys Ala Leu Thr lie Cys Thr 
195 200 

act cac gag cag acc act gcc get gag cgt 
Thr His Glu Gin Thr Thr Ala Ala Glu Arg 
210 215 

atg ate aaa ate gca ctg gaa tec gtt ctg 
Met lie Lys lie Ala Leu Glu Ser Val Leu 
225 230 




of 49 

tac ggc gtc get gca gaa 576 
Tyr Gly Val Ala Ala Glu 
190 

gta tct gae cac ate cgc 624 
Val Ser Asp His lie Arg 
205 

cag act acc ttc aac gae 672 
Gin Thr Thr Phe Asn Asp 
220 

c^tg ggc gat aaa gag taa 72 0 

Leu Gly Asp Lys Glu 

235 



<210> 4 
<211> 239 
<212> PRT 

<213> Escherichia coli 
<400> 4 

Met Ala Thr Pro His lie Asn Ala Glu Met Gly Asp Phe Ala Asp Val 
15 10 15 

Val Leu Met Pro Gly Asp Pro Leu Arg Ala Lys Tyr lie Ala Glu Thr 
20 25 30 

Phe Leu Glu Asp Ala Arg Glu Val Asn Asn Val Arg Gly Met Leu Gly 
35 40 45 
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Phe Thr Gly Thr 
50 

Met Gly lie Pro 
65 

Phe Gly Val Lys 

Pro His Val Lys 
100 

Asp Ser Lys Val 
115 

He Ala Asp Phe 
130 

Leu Gly He Asp 
145 

Tyr Ser Pro Asp 

Leu Gly Val Glu 
180 

Phe Gly Ala Lys 
195 

Thr His Glu Gin 
210 



Tyr Lys Gly Arg 
55 

Ser Cys Ser He 
70 

Lys He He Arg 
85 

Leu Arg Asp Val 

Asn Arg He Arg 
120 

Asp Met Val Arg 
135 

Ala Arg Val Gly 
150 

Gly Glu Met Phe 
165 

Met Glu Ala Ala 

Ala Leu Thr He 
200 

Thr Thr Ala Ala 
215 
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Lys He Ser Val 
60 

Tyr Thr Lys Glu 
75 

Val Gly Ser Cys 
90 

Val He Gly Met 
105 

Phe Lys Asp His 

Asn Ala Val Asp 
140 

Asn Leu Phe Ser 
155 

Asp Val Met Glu 
170 

Gly He Tyr Gly 
185 

Cys Thr Val Ser 

Glu Arg Gin Thr 
220 




Met Gly His Gly 

Leu He Thr Asp 
80 

Gly Ala Val Leu 
95 

Gly Ala Cys Thr 
110 

Asp Phe Ala Ala 
125 

Ala Ala Lys Ala 

Ala Asp Leu Phe 
160 

Lys Tyr Gly He 
175 

Val Ala Ala Glu 
190 

Asp His He Arg 
205 

Thr Phe Asn Asp 
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Met lie Lys lie Ala Leu Glu Ser Val Leu Leu Gly Asp Lys Glu 
225 230 235 



<210> 5 
<211> 1224 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . (1221) 

<400> 5 

atg aaa cgt gca ttt att atg gtg ctg 
Met Lys Arg Ala Phe lie Met Val Leu 
1 5 

aca gaa gat gca gaa cgc ttt ggt gac 
Thr Glu Asp Ala Glu Arg Phe Gly Asp 
20 25 

cat ate gca gaa get tgt gcc aaa ggc 
His lie Ala Glu Ala Cys Ala Lys Gly 
35 40 

ggc ccg etc aat ctg cca aat ctg ace 
Gly Pro Leu Asn Leu Pro Asn Leu Thr 
50 55 



gac tea ttc ggc ate ggc get 4 8 
Asp Ser Phe Gly lie Gly Ala 
10 15 

gtc ggg get gac acc ctg ggt 96 
Val Gly Ala Asp Thr Leu Gly 
30 

gaa get gat aac ggt cgt aaa 144 
Glu Ala Asp Asn Gly Arg Lys 
45 

cgt ctg ggg ctg gcg aaa gca 192 
Arg Leu Gly Leu Ala Lys Ala 
60 
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cac gaa ggt tct acc ggt ttc att ccg gcg gga atg gac ggc aac get 240 

His Glu Gly Ser Thr Gly Phe lie Pro Ala Gly Met Asp Gly Asn Ala 

65 70 75 80 

gaa gtt ate ggc gcg tac gca tgg gcg cac gaa atg tea tec ggt aaa 2 88 

Glu Val lie Gly Ala Tyr Ala Trp Ala His Glu Met Ser Ser Gly Lys 
85 90 95 

gat acc ccg tct ggt cac tgg gaa att gcc ggt gtc ccg gtt ctg ttt 336 

Asp Thr Pro Ser Gly His Trp Glu lie Ala Gly Val Pro Val Leu Phe 
100 105 110 

gag tgg gga tat ttc tec gat cac gaa aac age ttc ccg caa gag ctg 3 84 

Glu Trp Gly Tyr Phe Ser Asp His Glu Asn Ser Phe Pro Gin Glu Leu 

115 120 125 

ctg gat aaa ctg gtc gaa cgc get aat ctg ccg ggt tac etc ggt aac 432 

Leu Asp Lys Leu Val Glu Arg Ala Asn Leu Pro Gly Tyr Leu Gly Asn 
130 135 140 

tgc cac tct tec ggt acg gtc att ctg gat caa ctg ggc gaa gag cac 480 

Cys His Ser Ser Gly Thr Val lie Leu Asp Gin Leu Gly Glu Glu His 

145 150 155 160 

atg aaa ace ggc aag ccg att ttc tat acc tec get gac tec gtg ttc 52 8 

Met Lys Thr Gly Lys Pro lie Phe Tyr Thr Ser Ala Asp Ser Val Phe 
165 170 175 

eag att gee tgc cat gaa gaa act ttc ggt ctg gat aaa etc tac gaa 576 

Gin lie Ala Cys His Glu Glu Thr Phe Gly Leu Asp Lys Leu Tyr Glu 
180 185 190 
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Page 14 

ctg tgc gaa ate gcc cgt gaa gag ctg acc 
Leu Cys Glu lie Ala Arg Glu Glu Leu Thr 
195 200 




of 49 

aac ggc ggc tac aat ate 624 
Asn Gly Gly Tyr Asn lie 
205 



ggt cgt gtt ate get egt eeg ttt ate ggc gae aaa gee ggt aac ttc 672 

Gly Arg Val lie Ala Arg Pro Phe lie Gly Asp Lys Ala Gly Asn Phe 

210 215 220 

cag cgt acc ggt aac cgt cac gac ctg get gtt gag ccg cca gca ccg 720 

Gin Arg Thr Gly Asn Arg His Asp Leu Ala Val Glu Pro Pro Ala Pro 

225 230 235 240 

acc gtg ctg cag aaa ctg gtt gat gaa aaa cac ggc cag gtg gtt tct 76 8 

Thr Val Leu Gin Lys Leu Val Asp Glu Lys His Gly Gin Val Val Ser 
245 250 255 

gte ggt aaa att gcg gac ate tac gcc aac tgc ggt ate acc aaa aaa 816 

Val Gly Lys lie Ala Asp lie Tyr Ala Asn Cys Gly lie Thr Lys Lys 

260 265 270 

gtg aaa gcg act ggc ctg gac gcg ctg ttt gac gcc acc ate aaa gag 864 

Val Lys Ala Thr Gly Leu Asp Ala Leu Phe Asp Ala Thr lie Lys Glu 
275 280 285 

atg aaa gaa gcg ggt gat aac ace ate gte ttc acc aac ttc gtt gac 912 

Met Lys Glu Ala Gly Asp Asn Thr lie Val Phe Thr Asn Phe Val Asp 

290 295 300 

ttc gac tct tec tgg ggc cac cgt ege gae gte gee ggt tat gee gcg 960 

Phe Asp Ser Ser Trp Gly His Arg Arg Asp Val Ala Gly Tyr Ala Ala 

305 310 315 320 
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ggt ctg gaa ctg ttc gac cgc cgt ctg ccg gag ctg atg tct ctg ctg 1008 
Gly Leu Glu Leu Phe Asp Arg Arg Leu Pro Glu Leu Met Ser Leu Leu 
325 330 335 

cgc gat gac gac ate ctg ate etc acc get gac eac ggt tge gat ccg 1056 
Arg Asp Asp Asp lie Leu lie Leu Thr Ala Asp His Gly Cys Asp Pro 
340 345 350 

acc tgg acc ggt act gac cac acg cgt gaa cac att ccg gta ctg gta 1104 
Thr Trp Thr Gly Thr Asp His Thr Arg Glu His lie Pro Val Leu Val 
355 360 365 

tat ggc ccg aaa gta aaa ceg ggc tea ctg ggt cat cgt gaa acc ttc 1152 
Tyr Gly Pro Lys Val Lys Pro Gly Ser Leu Gly His Arg Glu Thr Phe 
370 375 380 

gcg gat ate ggc cag act ctg gea aaa tat ttt ggt act tct gat atg 12 0 0 
Ala Asp lie Gly Gin Thr Leu Ala Lys Tyr Phe Gly Thr Ser Asp Met 
385 390 395 400 

gaa tat ggc aaa gcc atg ttc tga 1224 
Glu Tyr Gly Lys Ala Met Phe 
405 



<210> 6 
<211> 407 
<212> PRT 

<213> Escherichia coli 

<400> 6 

Met Lys Arg Ala Phe lie Met Val Leu Asp Ser Phe Gly lie Gly Ala 
15 10 15 
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Thr Glu Asp Ala Glu Arg Phe Gly Asp Val Gly Ala Asp Thr Leu Gly 
20 25 30 

His lie Ala Glu Ala Cys Ala Lys Gly Glu Ala Asp Asn Gly Arg Lys 
35 40 45 

Gly Pro Leu Asn Leu Pro Asn Leu Thr Arg Leu Gly Leu Ala Lys Ala 
50 55 60 

His Glu Gly Ser Thr Gly Phe lie Pro Ala Gly Met Asp Gly Asn Ala 
65 70 75 80 

Glu Val lie Gly Ala Tyr Ala Trp Ala His Glu Met Ser Ser Gly Lys 
85 90 95 

Asp Thr Pro Ser Gly His Trp Glu lie Ala Gly Val Pro Val Leu Phe 
100 105 110 

Glu Trp Gly Tyr Phe Ser Asp His Glu Asn Ser Phe Pro Gin Glu Leu 
115 120 125 

Leu Asp Lys Leu Val Glu Arg Ala Asn Leu Pro Gly Tyr Leu Gly Asn 
130 135 140 

Cys His Ser Ser Gly Thr Val lie Leu Asp Gin Leu Gly Glu Glu His 
145 150 155 160 

Met Lys Thr Gly Lys Pro lie Phe Tyr Thr Ser Ala Asp Ser Val Phe 
165 170 175 

Gin lie Ala Cys His Glu Glu Thr Phe Gly Leu Asp Lys Leu Tyr Glu 
180 185 190 
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Leu Cys Glu lie Ala Arg Glu Glu Leu Thr Asn Gly Gly Tyr Asn lie 
195 200 205 

Gly Arg Val lie Ala Arg Pro Phe lie Gly Asp Lys Ala Gly Asn Phe 
210 215 220 

Gin Arg Thr Gly Asn Arg His Asp Leu Ala Val Glu Pro Pro Ala Pro 
225 230 235 240 

Thr Val Leu Gin Lys Leu Val Asp Glu Lys His Gly Gin Val Val Ser 
245 250 255 

Val Gly Lys lie Ala Asp lie Tyr Ala Asn Cys Gly lie Thr Lys Lys 
260 265 270 

Val Lys Ala Thr Gly Leu Asp Ala Leu Phe Asp Ala Thr lie Lys Glu 
275 280 285 

Met Lys Glu Ala Gly Asp Asn Thr lie Val Phe Thr Asn Phe Val Asp 
290 295 300 

Phe Asp Ser Ser Trp Gly His Arg Arg Asp Val Ala Gly Tyr Ala Ala 
305 310 315 320 

Gly Leu Glu Leu Phe Asp Arg Arg Leu Pro Glu Leu Met Ser Leu Leu 
325 330 335 

Arg Asp Asp Asp lie Leu lie Leu Thr Ala Asp His Gly Cys Asp Pro 
340 345 350 

Thr Trp Thr Gly Thr Asp His Thr Arg Glu His lie Pro Val Leu Val 
355 360 365 
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Tyr Gly Pro Lys Val Lys Pro Gly Ser Leu Gly His Arg Glu Thr Phe 
370 375 380 

Ala Asp lie Gly Gin Thr Leu Ala Lys Tyr Phe Gly Thr Ser Asp Met 
385 390 395 400 

Glu Tyr Gly Lys Ala Met Phe 
405 



<210> 7 
<211> 780 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . (777) 

<400> 7 

atg act gat ctg aaa gca age age ctg cgt gca ctg aaa ttg atg gac 48 

Met Thr Asp Leu Lys Ala Ser Ser Leu Arg Ala Leu Lys Leu Met Asp 
15 10 15 

ctg aac acc ctg aat gac gac gac acc gac gag aaa gtg ate gcc ctg 96 
Leu Asn Thr Leu Asn Asp Asp Asp Thr Asp Glu Lys Val lie Ala Leu 
20 25 30 

tgt cat cag gcc aaa act ccg gtc ggc aat acc gcc get ate tgt ate 144 
Cys His Gin Ala Lys Thr Pro Val Gly Asn Thr Ala Ala lie Cys lie 
35 40 45 
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tat cct cgc ttt ate ccg att get cgc aaa act ctg aaa gag cag ggc 192 

Tyr Pro Arg Phe lie Pro lie Ala Arg Lys Thr Leu Lys Glu Gin Gly 

50 55 60 

acc ccg gaa ate cgt ate get aeg gta acc aac ttc cca cac ggt aac 240 

Thr Pro Glu lie Arg lie Ala Thr Val Thr Asn Phe Pro His Gly Asn 

65 70 75 80 

gac gac ate gac ate gcg ctg gca gaa acc cgt gcg gca ate gcc tac 288 

Asp Asp lie Asp lie Ala Leu Ala Glu Thr Arg Ala Ala lie Ala Tyr 

85 90 95 

ggt get gat gaa gtt gac gtt gtg ttc ccg tac cgc gcg ctg atg gcg 336 

Gly Ala Asp Glu Val Asp Val Val Phe Pro Tyr Arg Ala Leu Met Ala 
100 105 110 

ggt aac gag cag gtt ggt ttt gac ctg gtg aaa gcc tgt aaa gag get 3 84 

Gly Asn Glu Gin Val Gly Phe Asp Leu Val Lys Ala Cys Lys Glu Ala 

115 120 125 

tgc gcg gca gcg aat gta ctg ctg aaa gtg ate ate gaa acc ggc gaa 432 

Cys Ala Ala Ala Asn Val Leu Leu Lys Val lie lie Glu Thr Gly Glu 

130 135 140 

ctg aaa gac gaa gcg ctg ate cgt aaa gcg tct gaa ate tec ate aaa 480 

Leu Lys Asp Glu Ala Leu lie Arg Lys Ala Ser Glu lie Ser lie Lys 

145 150 155 160 

gcg ggt gcg gac ttc ate aaa acc tct acc ggt aaa gtg get gtg aac 52 8 

Ala Gly Ala Asp Phe lie Lys Thr Ser Thr Gly Lys Val Ala Val Asn 

165 170 175 
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gcg acg ccg gaa age gcg cgc ate atg atg gaa gtg ate cgt gat atg 576 
Ala Thr Pro Glu Ser Ala Arg lie Met Met Glu Val lie Arg Asp Met 
180 185 190 

ggc gta gaa aaa ace gtt ggt tte aaa ccg gcg ggc ggc gtg cgt act 624 
Gly Val Glu Lys Thr Val Gly Phe Lys Pro Ala Gly Gly Val Arg Thr 
195 200 205 

gcg gaa gat gcg cag aaa tat etc gee att gca gat gaa ctg ttc ggt 672 
Ala Glu Asp Ala Gin Lys Tyr Leu Ala lie Ala Asp Glu Leu Phe Gly 
210 215 220 

get gae tgg gea gat gcg cgt eae tac cgc ttt ggc get tec age ctg 72 0 
Ala Asp Trp Ala Asp Ala Arg His Tyr Arg Phe Gly Ala Ser Ser Leu 
225 230 235 240 

ctg gea age ctg ctg aaa gcg ctg ggt cac ggc gae ggt aag age gee 768 
Leu Ala Ser Leu Leu Lys Ala Leu Gly His Gly Asp Gly Lys Ser Ala 
245 250 255 

age age tac taa 780 
Ser Ser Tyr 



<210> 8 
<211> 259 
<212> PRT 

<213> Escherichia coli 
<400> 8 

Met Thr Asp Leu Lys Ala Ser Ser Leu Arg Ala Leu Lys Leu Met Asp 
15 10 15 
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Leu Asn Thr Leu Asn Asp Asp Asp Thr Asp Glu Lys Val lie Ala Leu 
20 25 30 

Cys His Gin Ala Lys Thr Pro Val Gly Asn Thr Ala Ala lie Cys lie 
35 40 45 

Tyr Pro Arg Phe lie Pro lie Ala Arg Lys Thr Leu Lys Glu Gin Gly 
50 55 60 

Thr Pro Glu lie Arg lie Ala Thr Val Thr Asn Phe Pro His Gly Asn 
65 70 75 80 

Asp Asp lie Asp lie Ala Leu Ala Glu Thr Arg Ala Ala lie Ala Tyr 
85 90 95 

Gly Ala Asp Glu Val Asp Val Val Phe Pro Tyr Arg Ala Leu Met Ala 
100 105 110 

Gly Asn Glu Gin Val Gly Phe Asp Leu Val Lys Ala Cys Lys Glu Ala 
115 120 125 

Cys Ala Ala Ala Asn Val Leu Leu Lys Val lie lie Glu Thr Gly Glu 
130 135 140 

Leu Lys Asp Glu Ala Leu lie Arg Lys Ala Ser Glu lie Ser lie Lys 
145 150 155 160 

Ala Gly Ala Asp Phe lie Lys Thr Ser Thr Gly Lys Val Ala Val Asn 
165 170 175 

Ala Thr Pro Glu Ser Ala Arg lie Met Met Glu Val lie Arg Asp Met 
180 185 190 
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Gly Val Glu Lys 
195 

Ala Glu Asp Ala 
210 

Ala Asp Trp Ala 
225 

Leu Ala Ser Leu 



Ser Ser Tyr 




Thr Val Gly Phe 
200 

Gin Lys Tyr Leu 
215 

Asp Ala Arg His 
230 

Leu Lys Ala Leu 
245 



Page 2 2 of 4 9 
Lys Pro Ala Gly 

Ala lie Ala Asp 
220 

Tyr Arg Phe Gly 
235 

Gly His Gly Asp 
250 




Gly Val Arg Thr 
205 

Glu Leu Phe Gly 

Ala Ser Ser Leu 
240 

Gly Lys Ser Ala 
255 



<210> 9 
<211> 1080 
<212> DNA 

<213> Escherichia coli 



<220> 

<221> CDS 

<222> (1) . . (1077) 



<400> 9 

atg tct aag att ttt gat ttc gta aaa act ggc gta ate act ggt gat 48 

Met Ser Lys lie Phe Asp Phe Val Lys Pro Gly Val lie Thr Gly Asp 
15 10 15 



gac gta cag aaa gtt ttc cag gta gca aaa gaa aac aac ttc gca ctg 96 
Asp Val Gin Lys Val Phe Gin Val Ala Lys Glu Asn Asn Phe Ala Leu 
20 25 30 
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cca gca gta aac tgc gtc ggt act gac tec ate aac gcc gta ctg gaa 144 

Pro Ala Val Asn Cys Val Gly Thr Asp Ser lie Asn Ala Val Leu Glu 
35 40 45 

acc get get aaa gtt aaa geg ecg gtt ate gtt cag ttc tec aac ggt 192 

Thr Ala Ala Lys Val Lys Ala Pro Val lie Val Gin Phe Ser Asn Gly 
50 55 60 

ggt get tee ttt ate get ggt aaa ggc gtg aaa tct gac gtt ecg cag 240 

Gly Ala Ser Phe lie Ala Gly Lys Gly Val Lys Ser Asp Val Pro Gin 
65 70 75 80 

ggt get get ate ctg ggc geg ate tct ggt geg cat cac gtt cac cag 2 88 

Gly Ala Ala lie Leu Gly Ala lie Ser Gly Ala His His Val His Gin 
85 90 95 

atg get gaa cat tat ggt gtt ecg gtt ate ctg cac act gac cac tgc 336 

Met Ala Glu His Tyr Gly Val Pro Val lie Leu His Thr Asp His Cys 
100 105 110 

geg aag aaa ctg ctg ecg tgg ate gac ggt ctg ttg gac geg ggt gaa 3 84 

Ala Lys Lys Leu Leu Pro Trp lie Asp Gly Leu Leu Asp Ala Gly Glu 
115 120 125 

aaa cac ttc gca get acc ggt aag ecg ctg ttc tct tct cac atg ate 432 

Lys His Phe Ala Ala Thr Gly Lys Pro Leu Phe Ser Ser His Met lie 
130 135 140 

gac ctg tct gaa gaa tct ctg caa gag aac ate gaa ate tgc tct aaa 480 

Asp Leu Ser Glu Glu Ser Leu Gin Glu Asn lie Glu lie Cys Ser Lys 

145 150 155 160 
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tac ctg gag cgc atg tec aaa ate ggc atg act ctg gaa ate gaa ctg 52 8 
Tyr Leu Glu Arg Met Ser Lys lie Gly Met Thr Leu Glu lie Glu Leu 
165 170 175 

9gt tge acc ggt ggt gaa gaa gac ggc gtg gac aac age cae atg gac 576 
Gly Cys Thr Gly Gly Glu Glu Asp Gly Val Asp Asn Ser His Met Asp 
180 185 190 

get tct gca ctg tac acc cag ccg gaa gac gtt gat tac gca tac acc 624 
Ala Ser Ala Leu Tyr Thr Gin Pro Glu Asp Val Asp Tyr Ala Tyr Thr 
195 200 205 

gaa ctg age aaa ate age eeg cgt tte ace ate gca gcg tec ttc ggt 672 
Glu Leu Ser Lys lie Ser Pro Arg Phe Thr lie Ala Ala Ser Phe Gly 
210 215 220 

aac gta cac ggt gtt tac aag ccg ggt aae gtg gtt ctg act eeg acc 72 0 
Asn Val His Gly Val Tyr Lys Pro Gly Asn Val Val Leu Thr Pro Thr 
225 230 235 240 

ate ctg cgt gat tct cag gaa tat gtt tec aag aaa cac aac ctg ccg 768 
lie Leu Arg Asp Ser Gin Glu Tyr Val Ser Lys Lys His Asn Leu Pro 
245 250 255 

cac aac age ctg aac ttc gta ttc cac ggt ggt tec ggt tct act get 816 
His Asn Ser Leu Asn Phe Val Phe His Gly Gly Ser Gly Ser Thr Ala 
260 265 270 

cag gaa ate aaa gac tec gta age tac ggc gta gta aaa atg aae ate 864 
Gin Glu lie Lys Asp Ser Val Ser Tyr Gly Val Val Lys Met Asn lie 
275 280 285 
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ctg aac gcg ate gac gtt ctg taa 
Leu Asn Ala lie Asp Val Leu 
355 



912 



960 
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gat acc gat acc caa tgg gca acc tgg gaa ggc gtt ctg aac tac tac 

Asp Thr Asp Thr Gin Trp Ala Thr Trp Glu Gly Val Leu Asn Tyr Tyr 
290 295 300 

aaa gcg aac gaa get tat ctg cag ggt cag ctg ggt aac ccg aaa ggc 

Lys Ala Asn Glu Ala Tyr Leu Gin Gly Gin Leu Gly Asn Pro Lys Gly 

305 310 315 320 

gaa gat cag ccg aac aag aaa tac tac gat ccg cgc gta tgg ctg cgt 

Glu Asp Gin Pro Asn Lys Lys Tyr Tyr Asp Pro Arg Val Trp Leu Arg 
325 330 335 

gcc ggt cag act teg atg ate get cgt ctg gag aaa gca ttc cag gaa 1056 
Ala Gly Gin Thr Ser Met lie Ala Arg Leu Glu Lys Ala Phe Gin Glu 

340 345 350 



1008 



1080 



<210> 10 
<211> 359 
<212> PRT 

<213> Escherichia coli 
<400> 10 

Met Ser Lys lie Phe Asp Phe Val Lys Pro Gly Val lie Thr Gly Asp 
15 10 15 

Asp Val Gin Lys Val Phe Gin Val Ala Lys Glu Asn Asn Phe Ala Leu 
20 25 30 
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Pro Ala Val Asn Cys Val Gly Thr Asp Ser lie Asn Ala Val Leu Glu 
35 40 45 

Thr Ala Ala Lys Val Lys Ala Pro Val lie Val Gin Phe Ser Asn Gly 
50 55 60 

Gly Ala Ser Phe lie Ala Gly Lys Gly Val Lys Ser Asp Val Pro Gin 
65 70 75 80 

Gly Ala Ala lie Leu Gly Ala lie Ser Gly Ala His His Val His Gin 
85 90 95 

Met Ala Glu His Tyr Gly Val Pro Val lie Leu His Thr Asp His Cys 
100 105 110 

Ala Lys Lys Leu Leu Pro Trp lie Asp Gly Leu Leu Asp Ala Gly Glu 
115 120 125 

Lys His Phe Ala Ala Thr Gly Lys Pro Leu Phe Ser Ser His Met lie 
130 135 140 

Asp Leu Ser Glu Glu Ser Leu Gin Glu Asn lie Glu lie Cys Ser Lys 
145 150 155 160 

Tyr Leu Glu Arg Met Ser Lys lie Gly Met Thr Leu Glu lie Glu Leu 
165 170 175 

Gly Cys Thr Gly Gly Glu Glu Asp Gly Val Asp Asn Ser His Met Asp 
180 185 190 

Ala Ser Ala Leu Tyr Thr Gin Pro Glu Asp Val Asp Tyr Ala Tyr Thr 
195 200 205 
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Glu Leu Ser Lys 
210 

Asn Val His Gly 
225 

lie Leu Arg Asp 

His Asn Ser Leu 
260 

Gin Glu lie Lys 
275 

Asp Thr Asp Thr 
290 

Lys Ala Asn Glu 
305 

Glu Asp Gin Pro 

Ala Gly Gin Thr 
340 

Leu Asn Ala lie 
355 




lie Ser Pro Arg 
215 

Val Tyr Lys Pro 
230 

Ser Gin Glu Tyr 
245 

Asn Phe Val Phe 

Asp Ser Val Ser 
280 

Gin Trp Ala Thr 
295 

Ala Tyr Leu Gin 
310 

Asn Lys Lys Tyr 
325 

Ser Met lie Ala 
Asp Val Leu 
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Phe Thr lie Ala 
220 

Gly Asn Val Val 
235 

Val Ser Lys Lys 
250 

His Gly Gly Ser 
265 

Tyr Gly Val Val 

Trp Glu Gly Val 
300 

Gly Gin Leu Gly 
315 

Tyr Asp Pro Arg 
330 

Arg Leu Glu Lys 
345 



Ala Ser Phe Gly 

Leu Thr Pro Thr 
240 

His Asn Leu Pro 
255 

Gly Ser Thr Ala 
270 

Lys Met Asn lie 
285 

Leu Asn Tyr Tyr 

Asn Pro Lys Gly 
320 

Val Trp Leu Arg 
335 

Ala Phe Gin Glu 
350 
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<210> 11 
<211> 921 
<212> DNA 

<213> Salmonella typhi 

<220> 

<221> CDS 

<222> (1) . . (918) 



<400> 11 

atg gat ate gcg gtt att ggc tct 
Met Asp lie Ala Val lie Gly Ser 
1 5 



aac atg gtg gac ctt ate acc tac 48 
Asn Met Val Asp Leu lie Thr Tyr 
10 15 



acc aac cag atg ccc aaa gaa ggg gaa act ctg gaa gcg ccg gcg ttt 96 
Thr Asn Gin Met Pro Lys Glu Gly Glu Thr Leu Glu Ala Pro Ala Phe 
20 25 30 



aaa ate ggc tgc ggc gga aaa ggg gcg aac cag gee gtg gcg gee get 144 
Lys lie Gly Cys Gly Gly Lys Gly Ala Asn Gin Ala Val Ala Ala Ala 
35 40 45 



aag etc aat tea aaa gta ttg atg ttg acc aaa gtg ggc gac gat att 
Lys Leu Asn Ser Lys Val Leu Met Leu Thr Lys Val Gly Asp Asp lie 
50 55 60 



192 



ttt gee gac aac acc att cgt aat etc gaa tec tgg ggg ate aat aeg 240 
Phe Ala Asp Asn Thr lie Arg Asn Leu Glu Ser Trp Gly lie Asn Thr 
65 70 75 80 



acg tat gta gaa aaa gta ccg tgt acc age age ggc gta gcg ccg att 
Thr Tyr Val Glu Lys Val Pro Cys Thr Ser Ser Gly Val Ala Pro lie 
85 90 95 



288 
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ttc gtc aac gcc aac tec age aac age att etg ate ate aaa ggc get 336 
Phe Val Asn Ala Asn Ser Ser Asn Ser lie Leu lie lie Lys Gly Ala 
100 105 110 

aae aag ttt cte teg ccg gaa gat ate gat ege geg geg gaa gat tta 3 84 
Asn Lys Phe Leu Ser Pro Glu Asp lie Asp Arg Ala Ala Glu Asp Leu 
115 120 125 

aaa aaa tgc cag ctt att gtt ctg caa etg gaa gtt cag ctt gaa acg 432 
Lys Lys Cys Gin Leu lie Val Leu Gin Leu Glu Val Gin Leu Glu Thr 
130 135 140 

gtt tat eac gca at a gaa ttt gge aag aaa eae ggg att gaa gtg tta 4 80 
Val Tyr His Ala lie Glu Phe Gly Lys Lys His Gly lie Glu Val Leu 
145 150 155 160 

tta aac ect geg eca gea tta egg gaa tta gat atg tet tat gcc tgt 52 8 
Leu Asn Pro Ala Pro Ala Leu Arg Glu Leu Asp Met Ser Tyr Ala Cys 
165 170 175 

aaa tgc gat ttc ttt gta cct aat gaa acc gag ctg gaa ata tta acc 576 
Lys Cys Asp Phe Phe Val Pro Asn Glu Thr Glu Leu Glu lie Leu Thr 
180 185 190 

ggt atg cea gtg gat ace tat gac eat att ege gea geg gca cgt teg 624 
Gly Met Pro Val Asp Thr Tyr Asp His lie Arg Ala Ala Ala Arg Ser 
195 200 205 

ctg gta gat aaa ggg etg aac aat att att gtc acc atg ggc gag aaa 672 
Leu Val Asp Lys Gly Leu Asn Asn lie lie Val Thr Met Gly Glu Lys 
210 215 220 
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ggc gcg ctg tgg atg acg cgt gac cag gaa gtc cat gtt ccg gcg ttt 72 0 
Gly Ala Leu Trp Met Thr Arg Asp Gin Glu Val His Val Pro Ala Phe 
225 230 235 240 

aga gtg aac get gtt gat acc age ggc gcg ggc gat gcc ttt ate ggc 76 8 
Arg Val Asn Ala Val Asp Thr Ser Gly Ala Gly Asp Ala Phe lie Gly 
245 250 255 

tgt tfcc gcg cat tac tac gtc cag age ggg gat gtg gaa gcc gcc atg 816 
Cys Phe Ala His Tyr Tyr Val Gin Ser Gly Asp Val Glu Ala Ala Met 
260 265 270 

aaa aaa gcc gtc etc ttt gee get ttc age gtc acc ggg aaa ggc ace 864 
Lys Lys Ala Val Leu Phe Ala Ala Phe Ser Val Thr Gly Lys Gly Thr 
275 280 285 

caa tec tct tat cea age att gag caa ttt aat gag tat ett teg ttg 912 
Gin Ser Ser Tyr Pro Ser lie Glu Gin Phe Asn Glu Tyr Leu Ser Leu 
290 295 300 

aac gaa taa 921 

Asn Glu 

305 



<210> 12 
<211> 306 
<212> PRT 

<213> Salmonella typhi 

Leu lie Thr Tyr 
15 



<400> 12 

Met Asp lie Ala Val lie Gly Ser Asn Met Val Asp 
15 10 
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Thr Asn Gin Met 
20 

Lys lie Gly Cys 
35 

Lys Leu Asn Ser 
50 

Phe Ala Asp Asn 
65 

Thr Tyr Val Glu 

Phe Val Asn Ala 
100 

Asn Lys Phe Leu 
115 

Lys Lys Cys Gin 
130 

Val Tyr His Ala 
145 

Leu Asn Pro Ala 



Lys Cys Asp Phe 
180 




Pro Lys Glu Gly 

Gly Gly Lys Gly 
40 

Lys Val Leu Met 
55 

Thr lie Arg Asn 
70 

Lys Val Pro Cys 
85 

Asn Ser Ser Asn 



Ser Pro Glu Asp 
120 

Leu lie Val Leu 
135 

lie Glu Phe Gly 
150 

Pro Ala Leu Arg 
165 

Phe Val Pro Asn 
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Glu Thr Leu Glu 
25 

Ala Asn Gin Ala 

Leu Thr Lys Val 
60 

Leu Glu Ser Trp 
75 

Thr Ser Ser Gly 
90 

Ser lie Leu lie 
105 

lie Asp Arg Ala 

Gin Leu Glu Val 
140 

Lys Lys His Gly 
155 

Glu Leu Asp Met 
170 

Glu Thr Glu Leu 
185 




Ala Pro Ala Phe 
30 

Val Ala Ala Ala 
45 

Gly Asp Asp lie 

Gly lie Asn Thr 
80 

Val Ala Pro lie 
95 

lie Lys Gly Ala 
110 

Ala Glu Asp Leu 
125 

Gin Leu Glu Thr 

lie Glu Val Leu 
160 

Ser Tyr Ala Cys 
175 

Glu lie Leu Thr 
190 
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Gly Met Pro Val 
195 

Leu Val Asp Lys 
210 

Gly Ala Leu Trp 
225 

Arg Val Asn Ala 

Cys Phe Ala His 
260 

Lys Lys Ala Val 
275 

Gin Ser Ser Tyr 
290 

Asn Glu 
305 



m 

Asp Thr Tyr Asp 
200 

Gly Leu Asn Asn 
215 

Met Thr Arg Asp 
230 

Val Asp Thr Ser 
245 

Tyr Tyr Val Gin 

Leu Phe Ala Ala 
280 

Pro Ser lie Glu 
295 
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His lie Arg Ala 

lie lie Val Thr 
220 

Gin Glu Val His 
235 

Gly Ala Gly Asp 
250 

Ser Gly Asp Val 
265 

Phe Ser Val Thr 

Gin Phe Asn Glu 
300 




Ala Ala Arg Ser 
205 

Met Gly Glu Lys 

Val Pro Ala Phe 
240 

Ala Phe lie Gly 
255 

Glu Ala Ala Met 
270 

Gly Lys Gly Thr 
285 

Tyr Leu Ser Leu 



<210> 13 
<211> 483 
<212> DMA 

<213> Lactobacillus leichmannii 

<220> 
<221> CDS 
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<222> (10) . . (480) 



<400> 13 

gtatactaa atg cca aaa aag acg ate tac ttc ggt gcc ggc tgg ttc act 51 
Met Pro Lys Lys Thr lie Tyr Phe Gly Ala Gly Trp Phe Thr 
15 10 



gac cgc caa aac aaa gcc tac aag gaa gcc atg gaa gcc etc aag gaa 
Asp Arg Gin Asn Lys Ala Tyr Lys Glu Ala Met Glu Ala Leu Lys Glu 
15 20 25 30 



99 



aac cca acg att gac ctg gaa aac age tac gtt ccc ctg gac aac cag 
Asn Pro Thr lie Asp Leu Glu Asn Ser Tyr Val Pro Leu Asp Asn Gin 
35 40 45 



147 



tac aag ggt ate egg gtt gat gaa eae eeg gaa tac ctg eat gae aag 
Tyr Lys Gly lie Arg Val Asp Glu His Pro Glu Tyr Leu His Asp Lys 
50 55 60 



195 



gtt tgg get acg gee ace tac aac aac gac ttg aac ggg ate aag ace 243 

Val Trp Ala Thr Ala Thr Tyr Asn Asn Asp Leu Asn Gly lie Lys Thr 

65 70 75 

aac gac ate atg ctg ggt gtc tac ate cct gac gaa gaa gac gtc ggc 2 91 

Asn Asp lie Met Leu Gly Val Tyr lie Pro Asp Glu Glu Asp Val Gly 

80 85 90 

cfc9 99c atg gaa ctg ggt tac gcc ttg age caa ggc aag tac gtc ett 33 9 

Leu Gly Met Glu Leu Gly Tyr Ala Leu Ser Gin Gly Lys Tyr Val Leu 

95 100 105 110 

ttg gtc ate eeg gac gaa gac tac ggc aag ccg ate aac etc atg age 3 87 

Leu Val lie Pro Asp Glu Asp Tyr Gly Lys Pro lie Asn Leu Met Ser 
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115 120 125 

tgg ggc gtc age gac aac gtg ate aag atg age eag ctg aag gac ttc 43 5 
Trp Gly Val Ser Asp Asn Val lie Lys Met Ser Gin Leu Lys Asp Phe 
130 135 140 

aac ttc aac aag ccg cgc ttc gac ttc tac gaa ggt gee gta tac taa 483 
Asn Phe Asn Lys Pro Arg Phe Asp Phe Tyr Glu Gly Ala Val Tyr 
145 150 155 

<210> 14 
<211> 157 
<212> PRT 

<213> Lactobacillus leichmannii 
<400> 14 

Met Pro Lys Lys Thr lie Tyr Phe Gly Ala Gly Trp Phe Thr Asp Arg 
15 10 15 

Gin Asn Lys Ala Tyr Lys Glu Ala Met Glu Ala Leu Lys Glu Asn Pro 
20 25 30 

Thr He Asp Leu Glu Asn Ser Tyr Val Pro Leu Asp Asn Gin Tyr Lys 
35 40 45 

Gly He Arg Val Asp Glu His Pro Glu Tyr Leu His Asp Lys Val Trp 
50 55 60 

Ala Thr Ala Thr Tyr Asn Asn Asp Leu Asn Gly He Lys Thr Asn Asp 
65 70 75 ' 80 

He Met Leu Gly Val Tyr He Pro Asp Glu Glu Asp Val Gly Leu Gly 
85 90 95 
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Met Glu Leu Gly Tyr Ala Leu Ser Gin Gly Lys Tyr Val Leu Leu Val 
100 105 110 

lie Pro Asp Glu Asp Tyr Gly Lys Pro lie Asn Leu Met Ser Trp Gly 
115 120 125 

Val Ser Asp Asn Val lie Lys Met Ser Gin Leu Lys Asp Phe Asn Phe 

130 135 140 

Asn Lys Pro Arg Phe Asp Phe Tyr Glu Gly Ala Val Tyr 
145 150 155 



<210> 15 
<211> 720 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) - - (717) 

<400> 15 

atg get acc cca cac att aat gca gaa atg ggc gat ttc get gac gta 4 8 
Met Ala Thr Pro His lie Asn Ala Glu Met Gly Asp Phe Ala Asp Val 
15 10 15 



gtt ttg atg cca ggc gac cog ctg cgt gcg aag tat att get gaa act 96 
Val Leu Met Pro Gly Asp Pro Leu Arg Ala Lys Tyr lie Ala Glu Thr 
20 25 30 
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ttc ctt gaa gat gcc cgt gaa gtg aac aac gtt cgc ggt atg ctg ggc 144 
Phe Leu Glu Asp Ala Arg Glu Val Asn Asn Val Arg Gly Met Leu Gly 
35 40 45 

ttc acc ggt act tac aaa ggc cgc aaa att tec gta atg ggt cac ggt 192 
Phe Thr Gly Thr Tyr Lys Gly Arg Lys lie Ser Val Met Gly His Gly 
50 55 60 

atg ggt ate ccg tec tge tec ate tac ace aaa gaa ctg ate ace gat 240 
Met Gly lie Pro Ser Cys Ser lie Tyr Thr Lys Glu Leu lie Thr Asp 
65 70 75 80 

ttc ggc gtg aag aaa att ate cgc gtg ggt tec tgt ggc gca gtt ctg 288 
Phe Gly Val Lys Lys lie lie Arg Val Gly Ser Cys Gly Ala Val Leu 
85 90 95 

ccg cac gta aaa ctg cgc gac gtc gtt ate ggt atg ggt acc tge acc 336 
Pro His Val Lys Leu Arg Asp Val Val lie Gly Met Gly Thr Cys Thr 
100 105 110 

gat tec aaa gtt aac cgc ate cgt ttt aaa gac cat gac ttt gee get 384 
Asp Ser Lys Val Asn Arg lie Arg Phe Lys Asp His Asp Phe Ala Ala 
115 120 125 

ate get gac ttc gac atg gtg cgt aac gca gta gat gca get aaa gca 432 
lie Ala Asp Phe Asp Met Val Arg Asn Ala Val Asp Ala Ala Lys Ala 
130 135 140 

ctg ggt att gat get cgc gtg ggt aac ctg ttc tec get gac ctg ttc 4 80 
Leu Gly lie Asp Ala Arg Val Gly Asn Leu Phe Ser Ala Asp Leu Phe 
145 150 155 160 
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Page 3 7 

tac tct ccg gac ggc gaa atg ttc gac gtg 
Tyr Ser Pro Asp Gly Glu Met Phe Asp Val 
165 170 

etc ggc gtg gaa atg gaa gcg get ggt ate 
Leu Gly Val Glu Met Glu Ala Ala Gly lie 
180 185 

ttt ggc gcg aaa gcc ctg acc ate tgc acc 
Phe Gly Ala Lys Ala Leu Thr lie Cys Thr 
195 200 

act cac gag cag acc act gcc get gag cgt 
Thr His Glu Gin Thr Thr Ala Ala Glu Arg 
210 215 

atg ate aaa ate gca ctg gaa tec gtt ctg 
Met lie Lys lie Ala Leu Glu Ser Val Leu 
225 230 




of 49 

atg gaa aaa tac ggc att 52 8 
Met Glu Lys Tyr Gly lie 
175 

tac ggc gtc get gca gaa 576 
Tyr Gly Val Ala Ala Glu 
190 

gta tct gac cac ate cgc 624 
Val Ser Asp His lie Arg 
205 

cag act acc ttc aac aac 672 
Gin Thr Thr Phe Asn Asn 
220 

ctg ggc gat aaa gag taa 72 0 

Leu Gly Asp Lys Glu 

235 



<210> 16 
<211> 239 
<212> PRT 

<213> Escherichia coli 
<400> 16 

Met Ala Thr Pro His He Asn Ala Glu Met Gly Asp Phe Ala Asp Val 
15 10 15 

Val Leu Met Pro Gly Asp Pro Leu Arg Ala Lys Tyr He Ala Glu Thr 
20 25 30 
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Phe Leu Glu Asp Ala Arg Glu Val Asn Asn Val Arg Gly Met Leu Gly 
35 40 45 

Phe Thr Gly Thr Tyr Lys Gly Arg Lys lie Ser Val Met Gly His Gly 
50 55 60 

Met Gly lie Pro Ser Cys Ser lie Tyr Thr Lys Glu Leu lie Thr Asp 
65 70 75 80 

Phe Gly Val Lys Lys lie lie Arg Val Gly Ser Cys Gly Ala Val Leu 
85 90 95 

Pro His Val Lys Leu Arg Asp Val Val lie Gly Met Gly Thr Cys Thr 
100 105 110 

Asp Ser Lys Val Asn Arg lie Arg Phe Lys Asp His Asp Phe Ala Ala 
115 120 125 

lie Ala Asp Phe Asp Met Val Arg Asn Ala Val Asp Ala Ala Lys Ala 
130 135 140 

Leu Gly lie Asp Ala Arg Val Gly Asn Leu Phe Ser Ala Asp Leu Phe 
145 150 155 160 

Tyr Ser Pro Asp Gly Glu Met Phe Asp Val Met Glu Lys Tyr Gly lie 
165 170 175 

Leu Gly Val Glu Met Glu Ala Ala Gly lie Tyr Gly Val Ala Ala Glu 
180 185 190 

Phe Gly Ala Lys Ala Leu Thr lie Cys Thr Val Ser Asp His lie Arg 
195 200 205 
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Thr His Glu Gin Thr Thr Ala Ala Glu Arg Gin Thr Thr Phe Asn Asn 
210 215 220 

Met He Lys He Ala Leu Glu Ser Val Leu Leu Gly Asp Lys Glu 
225 230 235 



<210> 17 
<211> 1224 
<212> DNA 

<213> Escherichia coli 



<220> 

<221> CDS 

<222> (1) . . (1221) 



<400> 17 

atg aaa cgt gca ttt att atg gtg 
Met Lys Arg Ala Phe He Met Val 
1 5 

aca gaa gat gca gaa cgc ttt ggt 
Thr Glu Asp Ala Glu Arg Phe Gly 
20 



ctg gac tea ttc ggc ate ggc get 4 8 
Leu Asp Ser Phe Gly He Gly Ala 
10 15 

gac gtc ggg get gac acc ctg ggt 96 
Asp Val Gly Ala Asp Thr Leu Gly 
25 30 



eat ate gca gaa get tgt gee aaa ggc gaa get gat aac ggt cgt aaa 144 
His He Ala Glu Ala Cys Ala Lys Gly Glu Ala Asp Asn Gly Arg Lys 
35 40 45 



ggc ceg etc aat ctg eea aat ctg acc cgt ctg ggg ctg gcg aaa gca 192 
Gly Pro Leu Asn Leu Pro Asn Leu Thr Arg Leu Gly Leu Ala Lys Ala 
50 55 60 
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tgc cac tct tec ggt acg gtc att ctg gat caa ctg ggc gaa gag cac 
Cys His Ser Ser Gly Thr Val lie Leu Asp Gin Leu Gly Glu Glu His 
145 150 155 160 

atg aaa acc ggc aag ccg att ttc tat acc tec get gac tec gtg ttc 
Met Lys Thr Gly Lys Pro lie Phe Tyr Thr Ser Ala Asp Ser Val Phe 
165 170 175 

cag att gee tgc cat gaa gaa act ttc ggt ctg gat aaa etc tac gaa 
Gin lie Ala Cys His Glu Glu Thr Phe Gly Leu Asp Lys Leu Tyr Glu 
180 185 190 



240 



288 



Page 40 of 4 9 

cac gaa ggt tct ace ggt ttc att ccg gcg gga atg gac ggc aac get 
His Glu Gly Ser Thr Gly Phe lie Pro Ala Gly Met Asp Gly Asn Ala 
65 70 75 80 

gaa gtt ate ggc gcg tac gca tgg gcg cac gaa atg tea tee ggt aaa 
Glu Val lie Gly Ala Tyr Ala Trp Ala His Glu Met Ser Ser Gly Lys 
85 90 95 



gat acc ccg tct ggt cac tgg gaa att gee ggc gtc ccg gtt ctg ttt 336 
Asp Thr Pro Ser Gly His Trp Glu lie Ala Gly Val Pro Val Leu Phe 
100 105 110 

gag tgg gga tat ttc tec gat cac gaa aac age ttc ccg caa gag ctg 3 84 
Glu Trp Gly Tyr Phe Ser Asp His Glu Asn Ser Phe Pro Gin Glu Leu 
115 120 125 

ctg gat aaa ctg gtc gaa cgc get aat ctg ccg ggt tac etc ggt aac 432 
Leu Asp Lys Leu Val Glu Arg Ala Asn Leu Pro Gly Tyr Leu Gly Asn 
130 135 140 



480 



528 



576 



25135563vl.w 



ttc gac tct tec tgg ggc cac cgt cgc gac gtc gcc ggt tat gcc gcg 
Phe Asp Ser Ser Trp Gly His Arg Arg Asp Val Ala Gly Tyr Ala Ala 
305 310 315 320 



768 
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ctg tgc gaa ate gcc cgt gaa gag ctg acc aac ggc ggc tac aat ate 624 

Leu Cys Glu He Ala Arg Glu Glu Leu Thr Asn Gly Gly Tyr Asn He 

195 200 205 

ggt cgt gtt ate get cgt ecg ttt ate ggc gac aaa gcc ggt aac ttc 672 
Gly Arg Val He Ala Arg Pro Phe He Gly Asp Lys Ala Gly Asn Phe 
210 215 220 

caa cgt acc ggt aac cgt cac gac ctg get gtt gag ccg cca gca ccg 72 0 
Gin Arg Thr Gly Asn Arg His Asp Leu Ala Val Glu Pro Pro Ala Pro 
225 230 235 240 

acc gtg ctg cag aaa ctg gtt gat gaa aaa cac ggc cag gtg gtt tct 
Thr Val Leu Gin Lys Leu Val Asp Glu Lys His Gly Gin Val Val Ser 
245 250 255 

gtc ggt aaa att gcg gac ate tac gcc aac tgc ggt ate acc aaa aaa 
Val Gly Lys He Ala Asp He Tyr Ala Asn Cys Gly He Thr Lys Lys 
260 265 270 

gtg aaa gcg act ggc ctg gac gcg ctg ttt gac acc acc ate aaa gag 
Val Lys Ala Thr Gly Leu Asp Ala Leu Phe Asp Thr Thr He Lys Glu 
275 280 285 

atg aaa gaa gcg ggt gat aac acc ate gtc ttc acc aac ttc gtt gac 912 
Met Lys Glu Ala Gly Asp Asn Thr He Val Phe Thr Asn Phe Val Asp 
290 295 300 



816 



864 



960 
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ggt ctg gaa ctg ttc gac cgc cgt ctg ccg gag ctg atg tct ctg ctg 
Gly Leu Glu Leu Phe Asp Arg Arg Leu Pro Glu Leu Met Ser Leu Leu 
325 330 335 



cgc gat gac gac ate ctg ate etc acc get gac cac ggt tgc gat ccg 
Arg Asp Asp Asp lie Leu lie Leu Thr Ala Asp His Gly Cys Asp Pro 
340 345 350 



acc tgg acc ggt act gac cac acg cgt gaa cac att ccg gta ctg gta 
Thr Trp Thr Gly Thr Asp His Thr Arg Glu His lie Pro Val Leu Val 
355 360 365 



tat ggc ccg aaa gta aaa ccg ggc tea ctg ggt cat cgt gaa acc ttc 
Tyr Gly Pro Lys Val Lys Pro Gly Ser Leu Gly His Arg Glu Thr Phe 
370 375 380 



gcg gat ate ggc eag act ctg gca aaa tat ttt ggt act tct gat atg 
Ala Asp lie Gly Gin Thr Leu Ala Lys Tyr Phe Gly Thr Ser Asp Met 
385 390 395 400 



gaa tat ggc aaa gcc atg ttc tga 
Glu Tyr Gly Lys Ala Met Phe 
405 



<210> 18 
<211> 407 
<212> PRT 

<213> Escherichia coli 



<400> 18 

Met Lys Arg Ala Phe lie Met Val Leu Asp Ser Phe Gly lie Gly Ala 
15 10 15 
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Thr Glu Asp Ala Glu Arg Phe Gly Asp Val Gly Ala Asp Thr Leu Gly 
20 25 30 

His lie Ala Glu Ala Cys Ala Lys Gly Glu Ala Asp Asn Gly Arg Lys 
35 40 45 

Gly Pro Leu Asn Leu Pro Asn Leu Thr Arg Leu Gly Leu Ala Lys Ala 
50 55 60 

His Glu Gly Ser Thr Gly Phe lie Pro Ala Gly Met Asp Gly Asn Ala 
65 70 75 80 

Glu Val lie Gly Ala Tyr Ala Trp Ala His Glu Met Ser Ser Gly Lys 
85 90 95 

Asp Thr Pro Ser Gly His Trp Glu He Ala Gly Val Pro Val Leu Phe 
100 105 110 

Glu Trp Gly Tyr Phe Ser Asp His Glu Asn Ser Phe Pro Gin Glu Leu 
115 120 125 

Leu Asp Lys Leu Val Glu Arg Ala Asn Leu Pro Gly Tyr Leu Gly Asn 
130 135 140 

Cys His Ser Ser Gly Thr Val He Leu Asp Gin Leu Gly Glu Glu His 
145 150 155 160 

Met Lys Thr Gly Lys Pro He Phe Tyr Thr Ser Ala Asp Ser Val Phe 
165 170 175 

Gin He Ala Cys His Glu Glu Thr Phe Gly Leu Asp Lys Leu Tyr Glu 
180 185 190 
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Leu Cys Glu He Ala Arg Glu Glu Leu Thr Asn Gly Gly Tyr Asn He 
195 200 205 



Gly Arg Val lie 
210 

Gin Arg Thr Gly 
225 

Thr Val Leu Gin 



Val Gly Lys He 
260 

Val Lys Ala Thr 
275 

Met Lys Glu Ala 
290 

Phe Asp Ser Ser 
305 

Gly Leu Glu Leu 

Arg Asp Asp Asp 
340 

Thr Trp Thr Gly 
355 



Ala Arg Pro Phe 
215 

Asn Arg His Asp 
230 

Lys Leu Val Asp 
245 

Ala Asp He Tyr 

Gly Leu Asp Ala 
280 

Gly Asp Asn Thr 
295 

Trp Gly His Arg 
310 

Phe Asp Arg Arg 
325 

He Leu He Leu 

Thr Asp His Thr 
360 



He Gly Asp Lys 
220 

Leu Ala Val Glu 
235 

Glu Lys His Gly 
250 

Ala Asn Cys Gly 
265 

Leu Phe Asp Thr 

He Val Phe Thr 
300 

Arg Asp Val Ala 
315 

Leu Pro Glu Leu 
330 

Thr Ala Asp His 
345 

Arg Glu His He 



Ala Gly Asn Phe 

Pro Pro Ala Pro 
240 

Gin Val Val Ser 
255 

He Thr Lys Lys 
270 

Thr He Lys Glu 
285 

Asn Phe Val Asp 

Gly Tyr Ala Ala 
320 

Met Ser Leu Leu 
335 

Gly Cys Asp Pro 
350 

Pro Val Leu Val 
365 
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Tyr Gly Pro Lys 
370 

Ala Asp lie Gly 
385 

Glu Tyr Gly Lys 




Val Lys Pro Gly 
375 

Gin Thr Leu Ala 
390 

Ala Met Phe 
405 
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Ser Leu Gly His 
380 

Lys Tyr Phe Gly 
395 




Arg Glu Thr Phe 

Thr Ser Asp Met 
400 



<210> 19 
<211> 780 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . (777) 



<400> 19 

atg act gat ctg aaa gca age age ctg cgt gca ctg aaa ttg atg gac 
Met Thr Asp Leu Lys Ala Ser Ser Leu Arg Ala Leu Lys Leu Met Asp 
15 10 15 



ctg aac acc ctg aat gac gac gac ace gac gag aaa gtg ate gee ctg 
Leu Asn Thr Leu Asn Asp Asp Asp Thr Asp Glu Lys Val lie Ala Leu 
20 25 30 



tgt cat cag gcc aaa act ccg gtc ggc aat acc gee get ate tgt ate 
Cys His Gin Ala Lys Thr Pro Val Gly Asn Thr Ala Ala lie Cys lie 
35 40 45 



144 
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tat cct cgc ttt ate ccg att get cgc aaa act ctg aaa gag cag ggc 192 

Tyr Pro Arg Phe lie Pro lie Ala Arg Lys Thr Leu Lys Glu Gin Gly 

50 55 60 

acc ccg gaa ate cgt ate get aeg gta ace aae ttc cea cac ggt aac 24 0 

Thr Pro Glu lie Arg lie Ala Thr Val Thr Asn Phe Pro His Gly Asn 
65 70 75 80 

gac gae ate gac ate gcg etg gca gaa acc cgt gcg gea ate gee tac 288 

Asp Asp lie Asp lie Ala Leu Ala Glu Thr Arg Ala Ala lie Ala Tyr 
85 90 95 

ggt get gat gaa gtt gae gtt gtg ttc ccg tac cgc gcg ctg atg gcg 336 

Gly Ala Asp Glu Val Asp Val Val Phe Pro Tyr Arg Ala Leu Met Ala 

100 105 110 

ggt aac gag cag gtt ggt ttt gac ctg gtg aaa gee tgt aaa gag get 3 84 

Gly Asn Glu Gin Val Gly Phe Asp Leu Val Lys Ala Cys Lys Glu Ala 

115 120 125 

tgc gcg gca gcg aat gta etg ctg aaa gtg ate ate gaa ace ggc gaa 432 

Cys Ala Ala Ala Asn Val Leu Leu Lys Val lie He Glu Thr Gly Glu 

130 135 140 

ctg aaa gae gaa gcg etg ate cgt aaa gcg tet gaa ate tec ate aaa 480 

Leu Lys Asp Glu Ala Leu He Arg Lys Ala Ser Glu He Ser He Lys 
145 150 155 160 

gcg ggt gtg gac ttc ate aaa acc tct acc ggt aaa gtg get gtg aac 52 8 

Ala Gly Val Asp Phe He Lys Thr Ser Thr Gly Lys Val Ala Val Asn 
165 170 175 
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gcg acg ccg gaa age gcg cgc ate atg atg gaa gtg ate cgt gat atg 576 
Ala Thr Pro Glu Ser Ala Arg lie Met Met Glu Val lie Arg Asp Met 
180 185 190 



ggc gta gaa aaa acc gtt ggt ttc aaa ccg gcg ggc ggc gtg cgt act 624 
Gly Val Glu Lys Thr Val Gly Phe Lys Pro Ala Gly Gly Val Arg Thr 
195 200 205 

gcg gaa gat gcg cag aaa tat etc gee att gea gat gaa ctg ttc ggt 672 
Ala Glu Asp Ala Gin Lys Tyr Leu Ala lie Ala Asp Glu Leu Phe Gly 
210 215 220 

get gae tgg gea gat gcg cgt cac tac cgc ttt ggc get tec age ctg 720 
Ala Asp Trp Ala Asp Ala Arg His Tyr Arg Phe Gly Ala Ser Ser Leu 
225 230 235 240 

ctg gea age ctg ctg aaa gcg ctg ggt cac ggc gae ggt aag age gcc 768 
Leu Ala Ser Leu Leu Lys Ala Leu Gly His Gly Asp Gly Lys Ser Ala 
245 250 255 

age age tac taa 780 
Ser Ser Tyr 



<210> 20 
<211> 259 
<212> PRT 

<213> Escherichia coli 

Met Asp 
15 



<400> 20 

Met Thr Asp Leu Lys Ala Ser Ser Leu Arg Ala Leu Lys Leu 
15 10 
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Leu Asn Thr Leu Asn Asp Asp Asp Thr Asp Glu Lys Val lie Ala Leu 
20 25 30 

Cys His Gin Ala Lys Thr Pro Val Gly Asn Thr Ala Ala lie Cys lie 
35 40 45 

Tyr Pro Arg Phe lie Pro lie Ala Arg Lys Thr Leu Lys Glu Gin Gly 
50 55 60 

Thr Pro Glu lie Arg lie Ala Thr Val Thr Asn Phe Pro His Gly Asn 
65 70 75 80 

Asp Asp lie Asp lie Ala Leu Ala Glu Thr Arg Ala Ala lie Ala Tyr 
85 90 95 

Gly Ala Asp Glu Val Asp Val Val Phe Pro Tyr Arg Ala Leu Met Ala 
100 105 110 

Gly Asn Glu Gin Val Gly Phe Asp Leu Val Lys Ala Cys Lys Glu Ala 
115 120 125 

Cys Ala Ala Ala Asn Val Leu Leu Lys Val lie lie Glu Thr Gly Glu 
130 135 140 

Leu Lys Asp Glu Ala Leu lie Arg Lys Ala Ser Glu lie Ser lie Lys 
145 150 155 160 

Ala Gly Val Asp Phe lie Lys Thr Ser Thr Gly Lys Val Ala Val Asn 
165 170 175 

Ala Thr Pro Glu Ser Ala Arg lie Met Met Glu Val lie Arg Asp Met 
180 185 190 
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Gly Val Glu Lys Thr Val Gly Phe Lys Pro Ala Gly Gly Val Arg Thr 
195 200 205 

Ala Glu Asp Ala Gin Lys Tyr Leu Ala lie Ala Asp Glu Leu Phe Gly 
210 215 220 

Ala Asp Trp Ala Asp Ala Arg His Tyr Arg Phe Gly Ala Ser Ser Leu 
225 230 235 240 

Leu Ala Ser Leu Leu Lys Ala Leu Gly His Gly Asp Gly Lys Ser Ala 
245 250 255 

Ser Ser Tyr 
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DECLARATION FOR PATETSTT APPLICATION 



As a below named iuventor, I hereby declare that: 

My resident, post office address and citizetishi!p are as stated below next to my name. 

i 

I believe I am the orxginal;, first and sole inventor (if only one name is listed below) 
or an original, first and joint inventor (if plural namesj are listed below) of the subject matter 
which is claimed and for which a patent is sought on tte invention entitled ^ Eiizyinat.i.c 
syn thesis o£ DeoKyrlbonucXeosides ^the specification of which 



( ) is attached hereto. 

^k) was filed on jtozous-b 18, 2000 



IntamationaJ. 

, as Application Serial No. k:t/epoo/ 0808 8 



I hereby state thati have reviewed and understand the contents of the above identified 
specification^ including the claims, as amended by any amendment refexred to above. 

I 

I acknowledge the duty to disclose informntion which is material to the examination 
of tbis application in accordance with Title 37, Code of Federal Regulations, 1.56(a). 

Forei^ Priority Applicatioxis 

j 

I hereby claim foreignpriorily benefits under Title 3 5, United States Co de 1 19 of any 
foreign application(s) forpatent or inventor's certificate listed below and have also identified 
below any foreign application for patent or inventor's certificate having a filing date before 
that of the application on which priority is claimed: i 

I Priority Clflimed 



99 116 425.2 



Europe 



20/Auau.^f,/1 Q^<» 



(Number) 



(Country) (Day/MjonthATear Filed) 



JTesfcJ No() 



(Number) 



(Country) 



(Day/MonthAf'ear Filed) 



JfesO No() 



I hereby claim lite benefit under Title 35, United States Code, § 119(e) of any United 
States provisional application(s) listed below. 



(Application Nitmber) (Filing Date) 

U-S. Priority ApplicatioDS 

I hereby claim the benefit under Tide 35, United States Code, §120 of any United 
States applicatiQn(5) listed below and, insofar as the subject matter of each of the claims of 
this applicationis not disclosed in the prior United States application in the manner provided 
by the first para^aph of Title 35, United States Code, §112, 1 acknowledge ihe duty to 
disclose material infotmation as defined in Title 37, Code of Federal Regulations, §1.56(a) 
which occurred between the filing date of the prior application and the national or PCX 
international filing date of this application: 



(Applic. Serial No.) (Filing Date) (Statiis-patented/pending/abandoned) 



(Applic* Serial No,) (Filing Date) (Status-patented'^pending/abandoned) 

Power of Attorney 

I hereby appoint the following attorneys and patent agents to prosecute this 
application andlransact all business in the Patent and Trademark Office connected therewith: 
Peter F- Felfe, Reg. No- 20^297,- John E. Lynch, Reg. No. 20,940; Noiroan D. Hanson, Reg- 
No, 30^946; John A. Bauer, Reg* No. 32,554; Mary Anne Schofield, Reg. No. 36,669; James 
Zubok, Reg. No. 33,671; James R. Crawford, Reg. No. 39,155. Andrew Im, Reg. No. 40,657 
and David Rubra, Reg. No. 40,314; my attorneys with ftdl power of substitution and 
revocation- 
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Address all telephone calls to ' at (212) 3 18-3000- 

Address aU correspondence to: FULBRIGHT & JAWORSKI L-L.P. 

666 FiR±L Avenue 

New York, New York 10103 

I hereby declare that all statements made herein of my own knowledge are true and 
that all statements made on information and belief are believed to be true; and further that 
these statements were made with the knowledge that willful false statements and the like so 
made are punishable by fine or imprisonment, or both, under Section 1001 of Title 13 of the 
United States Code and tijat such wiUful false statements may jeopardize the validity of the 
application or any patent issued thereon. 

FuUJsfenlgTSQie or 1* irst Inventor Signature Date 

Residence: 82380 PelQenberg, GeJTinany 



POL. 



Post Office Address: rinKenweg 5 

323 80 PeifienJoerg, Germany 

Citizenship: German - 

(Ti Hans-Georg Xhlenfeldt: ^ / ^LttjS 6^0V] '^hdin/^/' ^^ 

FuU N'SleTSecond Inventor ■ Signature "7 Date 

Residence: 823 93 xf:£eld.or£f Germany 



Post Office Address: Rabenskopf straB e 5 

823 93 Xffeldorf^ Germany 

Citisrenship : German . 



(y)^ QG-fcavlanBarzvL 1 — J_ — — 

FiiQ NamsTTEifaiSventor Signature 

F-9 216 0 Antony r France ,r^J^ 
Residence: | — ^ 

3 allee ae la Bougainvillee 

Post Office Address: ^ ' ±±± — ^ ■ 

^ £.-92160 Antony, France 

Citi2:enship: T?V^ncti 



^^^e/FnarffiTEvStSr- Signature E?ate 



Residence: F-9219 0 Meudon, Finance 



Date 



Post Office Address: 3, sPnI-.i.er des J ardieB 

F-92190 Mendon, Prance 
Citizenship: Japan , — 

F^Sl5^ Signature 
Residence: F-940on r-r-^t-,^_li. France — 

11 rue de la Fosse aux Moines 

Post Office Address; _ ' ^ 4-^. i Fr-^nce ' 

F-94000 Creteil, France 

Citizensllip: French. . _ 
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Residencai f-9 145 0 .E-fc±o lies ^ Fz-ance 



Post OfScQ Address; 



Citizeiisijdqp; French 



2, all§a^ Saxnii Mar liln 
914S0 Et±olles, France 



FvJl MSiE / .Saven til In veil teas- 
Residence; g-75 012 garia^ Pyance 



Post Office Address; 
Citizcaialiip: FjreixcJi 



5r in^e ffn.TiRpr 

75 012 Paris, Franca 



Z€oZ 



Sigoatin-e Date ^ 
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